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ABSTRACT 


The STUDENT problem Eolving system, progeanmad in, LISP, ac¬ 
cepts as input a comfortable but restricted subset of English which 
car oKptoSs a wide variety of algebra story problems. STUDENT findE 
the Solution to a large class of these problems, STUDENT can utilize 
a store of global information not Specific to any one problem, and 
may make assumptions about the interpretation of ambiguities in the 
wording of the problem being solved + If it uses such information, 

Of moke* any assumptions, STUDENT Communicate e this fact to the user. 

The thesis includes a s^m^natry of other English language ques¬ 
tion-answering systems. All these Systems, and STUDENT, ere evalu- 
ated according to four standard criteria, 

The linguistic analysis in STUDENT is a first approximation 
to the analytic portion of a semantic theory of discourse outlined 
In the thesis. STUDENT finds the set of kernel sentences which -are 
the base of the input diEcaurse, and transforms this sequence of 
kernel sentences into a set of simuitaneouE equations which form the 
semantic base of the STUDENT system, STUDENT then tries to solve 
thie Set of equations for the values of requested unknowns. If it 
Is Successful it gives the answers in English. If not, STUDENT asks 
the user for core informations and indicates the nature of the de¬ 
sired information. The STUDENT system is o first step toward natu¬ 
ral language conr.uuicat ion with computers. Further work on the se¬ 
mantic theory proposed should result in much mere sophistiested 
systems. 
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tacAPTHK I: ^tkioe)[:c:tion 


The aira of the research reported here wis to discover how 
One could build a computer program which too Id eommuni c a t e with 
people in a natural language within Some restricted problem domain. 

In the course of this ioveet lotion, t wrote a sot- of computer pro- 
grants s the STUDENT iyetcnij which accepts as input a coir.f or table but 
restricted subset of English which can be used to express a wide 
variety of algebra Story problemsi The problems shown in Figure L 
i Host rate some of the comnurtlcatIon and problem solving capabil- 
idea of this system. 

In the following discussion, I shall use phrases such a a 
"the computer understands English 11 ! In all such cases* the "En¬ 
glish 1 ' is Just the restricted subset of English which is allowable 
as input for the computer program under discussion. In addition, 
for purposes of this report I have adopted the following operational 
definition of understanding. A computer understands a subset of En¬ 
glish if it accepts input sentences which are members of this subset, 
and answers questions based on infomat ion contained In the input. 

The STUDENT system understands English in this sensei 


A. The Problem Context of the STUDENT System. 

In constructing a question-answering system, many problems 
are greatly simplified if the problem context is restricted. The 
aimpiification result log from the restrictions embodied in the STU¬ 
DENT system, and the reasons these sirapliflost tons arise, will bo 
discussed in detail in the body of this report. 

The STUDENT system is designed to answer questions embedded 
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in English language statements of algebra story problems such a a 
those sli own in Figure 1 ■ STUDENT does this by const rue ting from 
the English input a Corresponding set of algebraic equations * and 
solving this act of equations for the requested unknowns- If 
needed, STUDENT has. access to a store of "global" information* 
not specific, to any particular problem, and can retrieve relevant 
facts and equations from this store of information. STUDENT com¬ 
ments on its progress in solving a problem, and can request the 
help of the questioner if it gets Stuck. 

There are a number of reasons why I chose the context of 
algebra story problems in which to develop techniques which would 
allow a computer problem solving system to accept natural Language 
input + First, we know a good type of data atrueCure in which to 
Store information needed Co answer questions in this context, 
namely, algebraic equation*. There exist will known algorithms 
for deducing information implicit in the equations, that ia* 
values fer particular variables which satisfy the Set of equations. 

Tn addition, I felt that there was a manageable subset of 
English in which many types of algebra story problems wore en- 
prossible. A. Large number of these story problems are available 
:n first year high school text books, and. 1 have transcribed, ^emve 
of them into STUDENT** input English. Since this cuescion-answer¬ 
ing CaBk ia one perfunned by humans, and since the entire process 
from input to solution of the equations was programmed, wc can ob¬ 
tain a measure of c ompa r is on between Che performance of STUDENT 
and of a human on Che same problems. In fact, this program on an 
UK 7094 answers most questions that it can handle as fast or 
faster than humans trying the same problem. Tn judging Ch is com¬ 
parison, one should, remember the base a peed of the IfiM 7094, which 
■can perform over one hundred thousand additions per second. 
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B+ Reasons for Wanting. Watmal Language Input, 

Why should one want to talk to a computer in Eng Lis tit There 
ate many tongues the computer already understands - auc.li as FORTRAN, 
CQMIT, LISP, ALGOL, COBOL, Co name just a few. These serve ade¬ 
quately as comunlcattcn media with the computer for a Large class 
of problems, A irorc pertinent question is really* when £s Engl 1 ah 
input to a computer desirable? 

English input ie desirable* for example, if it is neces sary 
to uae the computer for retrieveL of Lnfoduation from fi text In 
English. If a computer could accept English input, much information 
now recorded only in Engliah would be available for computer u6c 
without need for human tranalatlon. 

A computer which understood English would be more accessible 
to any speaker of English* whether or not he waa trained in any 
"foreign" computer tongue* For e single shot at the computer with 
a question not likely to be repeated, It would not be worthwhile 
to train the ubst in a spec 1$1lied language- For fact retrieval, 
rather than document retrieval* English is a good vehicle for 
Stating queries* For a good description of the differences between 
fact and document retrieval, see Ce-Oper (1£)* 

Programming languages are process oriented, fine cannot 
describe S problem, only a method for finding a solution to the prob¬ 
lem* A natural language is a convenient vehicle for providing a 
description of the problem itself* leaving the choice of processing 
to the problem Eolver accepting the input* In an extreme case, one 
woutd like to talk to tbe computer about a problem* with appropriate 
questions and interjections by the computer on assumptions it finds 
necessary, until the computer claims that the probLem is now well 
formed* and an attempt at solution can be made* 
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Finally, man 1 s ability to use symbols and language is *■ prime 
factor in his Intelligence, and if ve can learn how to make a com¬ 
puter understand a natural language, we will have tsktrt .-j big step 
toward creating an "artifinis 3ly intelligent" computet (32). 


G- Criteria for Evaluating Quest ion-Answerinft Systems, 

We have de-fined understanding in terms of an ability to An¬ 
swer questions in English. A numbe r of quest ion- answer ing systems 
have boon built, and will be described in the next section* In this 
section, we shall give a number of criteria for evaluating question- 
answu r ing s ys t ems . 

In many systems there is e separation of data Input and ques¬ 
tion input. For all Sy-stens under consider at i on* the Input questions 
are In English* The input data may be cither in English or in a 
preetructuted format, e.g. a tree or hierarchy. The English data 
input may bo used as a data base as is, or mapped into a structured 
information Store, 5 noons,, in his Competent survey English ques¬ 
tion-answering systems <-40) * calls those systems using a structured 
information store "data base quest ion-anewererg”, as opposed to 
"text-based quest ion-answerers" which retrieve facts from the original 
text, 


The extent of understanding of a quest ion-answering system 
can be measured along three different dimensions, syntactic, seman¬ 
tic and deductive, Along the syntactic dimension one can measure 
the grammatical complexity allowable in input sentences* This may 
differ for Che data input and question input. In the simplest case, 
one or some small number of fixed format sentences are allowable in¬ 
puts. Less restricted inputs may allow any sentences which can be 
parsed by a fixe3 gr^uim.:!r. The nearer this gramar is to ^ granular 
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of ail of English, the less restricted Is the input» Because text- 
based question'answerers accept as Input any String of words, with¬ 
out further processing they have no syntactic limitation on input. 
However, the fact-retrieval program may only be able to abstract 
information from those portions of a text with less than some maxi- 
muo syntactic complexity. 

In d ata base question-answering systems, only certain rela¬ 
tionships between words, or objecta, may he tepresentable in the 
information story. Other information may be discarded or ignored. 
This is a limitation In the semantic dimension of understanding. 

In order to obtain fifitfwers to questions not explicitly given 
in the input, a question-answering system must have the power to per¬ 
form some deductions* The structure of the informafcIon store may 
facilitate such deductive ability* The range of deductive ability 
is measured along the deductive dimension of understanding. The 
structure of the information Store may also aid in selecting only 
relevant material for use in the deductive quastiorti-answering pro¬ 
cess, thus improving the efficiency of the system. 

Another criteria closely related to the extent ef under- 
standing, is the facility with which the syntactic. Semantic* or 
deductive abilities of a quest ion-answering system can be extended, 
in the beet case one could improve the system nlong any dimension 
by teIking to it in English. Alternatively f one might have to add 
some new programs to the system, or at worst* any change might imply 
complete reprogramming of the entire system* 

An important additional consideration for users of a ques¬ 
tion-answering system is the amount of knowledge of the internal 
structure of the system that is necessary to use It, At best one 
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need not be aware of the infonaaC ton storage struct Lire used at all. 

At worst , a thorough knowledge of the internal structure m;iy be nec¬ 
essary to construct suitable input. 

Another measure of the usefulness of a quest!on-3nswering 
system is its ability to interact with the user. In the worst esse, 
a question is aaked and Sometime later an answer or report of fail¬ 
ure is given, When the question cannot be answered, no indication is 
given of the cause of failure, nor does the system allow the person 
to give any help. This is typical of the operation of a number of 
Air Force query systems (Jay Keyser* personal coirmunicHtiori) . In 
the best erne, the system will aak the user for specific help and 
accept suggestions of appropriate courses of action. 

In this section we have given four criteria for evaluating 
quest ton-answering systems, they may he summarised as follows: 

1) Extent of understanding (syntactic, semantic and de¬ 
ductive abilities) 

£) Facility for extending abilities (syntactic, semantic., 
d ed uctive) 

3) Need by user for knowledge of internal structure of 
system 

4) Extent of interaction with user 


D- English Language Quest ion-Answering Systems, 

In this section.j I shall give a critical eunnwry of a number 
Of English Language quest ion- answering systems, utilising the eri^ 
terie outlined in the previous section. This discussion will provide 
a context for the section, of the concluding chapter which suTicnsr licea 
the capabilities of the 5TUDE1TT system. For a description of the dif¬ 
ferent syntactic analysis schemes mentioned below, see the survey by 
lUibr-ew (4) * 
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1) Pblllitie. Gne of the cor licit quoStion-answering systems 
was written in I960 at HIT by .Anthony Phillips (35). It is a data 
base system which accepts sentences which can be parsed by a very 
simple context- free phrase structure gracmar, of the type defined by 
Chomsky C8J+ Additional syntactic restrict Ions require that each 
word must be in only one grammatical class* and that a sentence has 
exactly one parsing* 

A parsed sentence Is transformed Into a list of five ele¬ 
ments , the subject, verb* object, time phrase* and place phrase in 
the sentencei All other information in the sentence is disregarded. 
Questions are answered by matching the list from the transformed 
question against the list for each input sentence+ When a match is 
found, the corresponding sentence is given as an answer* 

Phillips f system ha@ no deductive ability and adding new 
abilities would require reprogr aimnlng the syEten+ A questioner must 
be aware that the system utility a matching process which does not 
recognize synonyms, and therefore the sentence "The teechet cats 
lunch St wn," will not be recognized as an answer to the question 
"What docs the teacher do at twelve o 1 clock? 11 When Phillips' system 
cannot find an answer, it reports only "(THE (JHACLE DOES NOT KNOW)"* 

It provides for no further interaction with the user. 

2) Green, Baseball is a question-answering system designed 
and programmed at Lincoln Laboratories by Green, Wolf, Chomsky and 
Laughety (19)+ It Is a date baae system, in which the data is placed 
in memory in a prestructured tree format. The data consist^ of the 
dates > location, opposing team* and scores of some American League 
baseball games. Only questions to the system can bo given in English, 
not the data. 
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Questions [Tills c be simple sentences , with no relative 
clauses, logical ot coordinate connec t tv c t * With these restrictions* 
the program will accept any question Couthcd in words contained in 
a vocabulary list quite adequate for asking questions shout base- 
hs11 st*tistice. In addition, the parsing routine, based on tech¬ 
niques deveLoped by Harris {21),, must find a parsing for the question 

The questions must pertain to statistics a bent baseball, 

Kamos found In the infomation store* One cannot ask questions 
about extrema, such as '''highest" SCOte or "fewest" number of genres 
won* The parsed question is transformed into a standard specifica- 
tion {or spec) list, and the question-onawexing routine utilises 
this canonical form for the meaning of the question* fur example, 
the question r, Vho beat the Yankees on July 4?" would bo transformed 
into the "spec list": 

Team (logins) * New York 
Team (winning)" ? 

Hate - July 4 

Because Baseball does not utilize English tor data input, we 
cannot talk about deductions made from information implicit in sev¬ 
eral sentences♦ However, Baseball can perform operations such as 
counting (the number of games played by Boston, for enable) jind 
thuE in the Sense that it is utilizing several separate data units 
in its store, it is performing deductions* 

Baseball’s abilities can only be extended by extensive re- 
programming, though the techniques utilised h»ve some genera] appli¬ 
cability. Because the parsing program hag * very complete Rrsrmar, 
and the vocabulary list is quite comprehensive for the problem domain 
the user needs no knowledge of the internal structure of the Base¬ 
ball program.. No provision for interaction with the user was made* 


3) S immcna , The SYNTH EX system is a text-baaed question-an~ 
swering ayatem designed and progfrtErvad at SDC by Simmons* Klein and 
HcCufiologne (41). The entire contents of a children's encyclopedia 
has been transcribed to magnetic tape for use as the information 
store* An index haa been prepared listing the location of all the 
content words in the text, i.e r including words like ’’worm*" "eat/' 
and ''birds/' while excluding function words like "and/' 11 the/' and 
"o£/ r All the content words of a question ore extracted, and in¬ 
formation rich sections of the text are retrieved, i.c* sections 
that are locally dense in content words contained in the question* 

For example, if the question were “What do worms eat?", with 
content words “worms" and "eat 11 ) the two sentences "BiTdE eat worms 
on the grass*" and "Most worms usually eat grass." might be retrieved. 
At this time, the program performs a syntactic analyeis of the ques¬ 
tion and of the sentences that may contain the answer. A comparison 
of the dependency trees of the question and various sentences may 
eliminate aome irrelevant sentences + In the example, "Sards eat 
worms on the grass" is eliminated because ''worms" is the object of 
the verb "eats'" instead of the subject os £n the question* In the 
general case, the retaining sentence® are given in some ranked order 
as possibly answering the question* 

SYM'fHKX is limited syntacticslly hy its grammar to the ex¬ 
tent that the syntactic analysis eliminates Irrelevant statements. 

It makes no use of the meaning of any Statements or words t and cannot 
deduce answers from information implicit in two or more sentences. 
Because the grammar is independent of the program, the syntactic 
ability of SYNthKX can be extended relatively c3®SJy* However, be¬ 
fore it can become a good question-answering system, some semantic 
abilities will h ave to be added* 


SY1THEX. docs not explicitly provide for interaction with the 



user, but betause It is implemented in the SDC time-sharing system 
(9) 5 a user may modify a previous question if the sentences re¬ 
trieved ire re not suitable. The mechanism for selection of sentences 
must be kept in mind to get best results. 

d) Lindsay. While at the Carnegie Institute o£ Technology, 
Robert Li-ndi-iy (2&) programmed the SAD SAM question-answering system. 
The input to the ByBtem is s set of sentences in Ensic English ± a 
subset of English devised by C + K + Ogden (35)> which has a vocabulary 
of about J5QC words and s simple subset of the full English gram- 
Tiuit. The SAD part ($yiitactic Appraiser and ft1agrummer) of $Aft SAM 
p-irses the sentence ofling A predictive analysis scheme. The Seman¬ 
tic Analysing Machine (SAM.) ex tract a from these parsed a ententes 
information about the family relational! lpe of people mentioned; it 
stores this inforoatlon on a computer representation of the family 
tree* and ignores all other information in the sentence. For example, 
from the parsing of "Ton, Mary 1 a brother> vent to the state." Lind¬ 
say's program would extract the sibling relationship of Tom and Mary* 
place them on the family tree as descendants of the Same mother and 
father, and ignore the information about where Tom wont. 

The information storage structure ntiliz-ed by SAD $AM, namely, 
the family tree, facilitates deduct ions from information implicit 
in many sentences- Because a family relationship is defined in 
terms of the relative position (no pun intended) of two people in 
their family tree, computition of the relationship is independent 
f£ the number of sentences required to place in the tree the path 
between the individual a. 

Extending the abilities of the SAD SAM system would require 
reprogramming. Mo provision is made for interaction with the user* 

HP internal knowledge of the program structure is necessary if the 
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user restricts his queries to questions of family relationships, and 
his language to Basic English. 

5) Raphael. The SIR question-answering system (mnemonic 
for Semantic Information Retrieval) was designed by Bertram Raphael 
(3ft) at MIT* The SIR system accepts simple sentences in any of 
about ZO fixed formats useful for expressing certain relationships 
between objects, Ilia semantic relationships extracted from these 
sentences are those of set membership, set inclusion, subpert, leEt- 
tb-right position and ownership. 

The information about the re1stionshlps between various ob¬ 
jects is stored in a semantic network, where the nodes of the net¬ 
work arc objects and the relationships are indicated by directed 
labeled links between nodes. For example. If the three sentences 
"John is a boy,," "A boy is a person," and "Two hands are part of 
any person" were an input to SIR, four nodes labeled, John , boy , 
person and hand would be created. Included in the network would be 
a link indicating. Eet membership between John and buy, another with 
a label indicating set inclusion between boy and person, and 4 Unk 
Indicating hand is a subpart of person, with the number o£ purts equal, 
to 2 + 


Separate question-anSW*ring routine* are used foT questions 
involying different relationships- Each routine takes cognizance 
of the interaction of various relationships, and cstt deduce answers 
from the linked structure of the network, independent of the number 
of sentences which were necessary to set up these links. For exam¬ 
ple, by tracing the links from "John" to "hand," SIR would answer 
"TES" 1 to the question "Is 3 hand part of John?" 


The SIR system can interact with the user- For example, if 






told that "A finger is port of a band 11 and a eked "how many fingers 
dots John have? 11 it would reply "How many finger a per hand?'* Then 
if it is told "Every hand has five fingers,” it would answer the 
question with '"The answer is 10". 

Any extensions of the SIR system necessitate additional pro¬ 
gramming effort, though it ia considerably easier to add new syntac¬ 
tic forms than new semantic relationships. Within the input limits 
of the 20 fined format statements, the user need not know anything 
of the internal structure of the informetinn storage structure. 


E « Other Related Work. 

In addition to those question-anewer Jug systems described 
ahove, a number of programs have been written to translate English 
Statements into a logical notation to check the consistency of a set 
of statements, and the validity of logical Otgunwjnts. In the sense 
thati given a corpus transformed to some logical notation, and another 
statement, a logic-based system can answer the question "Is this 
statement (or its negation) implied by the corpus?", such logic- 
based system# are qn(Stloc-answering systems. 

Cooper (.12) a-pd Darlington (14) both have programs which 
trimslato 4 Subset of English Into the propositional calculus. Dar¬ 
lington is also working On programs which can translate English into 
the first order and second order predicate calculi. A difficult prob¬ 
lem being considered by Darlington, in trying to handle implications 
of English statements in terms of their logical translation, is the 
determination of the proper level of analysis For a particular prob¬ 
lem - that ia, whether to translate the input ihtb second order 
predicate calculus whore proofs are very difficult, or to try to 
use first order predicate or propositional calculus to prove the 
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theorem, and perhaps find it Logically insufficient. 


At the National Bureau of Standards, Kirach (22), Cohen CiQ) 
and Sillars (39) have designed a system iti which pictures and English 
language statements are converted to expressions in the first order 
predicate calculus. One can then check to see if an English language 
statement Is conslatent with a given picture* 

McCarthy's Advice-Taker <3U}> though not designed to accept 
English input, would make an excellent Base for a quest ion-answering 
system. Fischer Block (2) has programmed a system which can do all 
of McCarthy's Advice-Taket problems, and esn be adapted to accept 0 
very limited subset of English. The deductive system in Black 1 5 
program is equivalent to tne propositional calculus. 

A number of people hove done work bearing directly oh the 
problem of solving algebra word problems stated in English. Sylvia 
Garfinkle (IS) wrote s paper in which she described the heuristics 
she would use in programming a computet to solve algebra word prob¬ 
lems, but never wrote the program. Most, of the heuristics were too 
vague to really be used; e.g. just stating that one should Identify 
two variables' names which are only slightly different, hut giving 
no good criteria for a slight difference, The treatment of "thiE 11 was 
taken from Garf inkle^s paper. So were s number of simplified state¬ 
ments of algebra story problems she transcribed and transformed from 
problems in a first year algebra text book* 

Michael Colaoan (11), at HIT, wrote a term paper describing 
a program of hi9 which sets up the equations for some types of alge¬ 
bra stc-ry problems (also handled by STUDENT). Some of the special 
heuristics I use for "age problems 11 were inspired by techniques he 
invented, 


£0 


In his theaia* David Kuck (24) describee hifi ideas on how Co 
construct this type of program, but again did not implement these ideas* 
He suggests netheds for transformerion of English input to equations 
which would require much more information about worda than is used 
in the STUDENT program, end therefore were not applicable in this work. 
The STUDENT progrsn considers worda aa symbols, and nakea do with 
ee Little knowledge ahout the meaning of word#; jp corapat ihle 
<*ith the goal of finding a aolution to the particular problem* 
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CHAFTER g; SEMANTIC GENERATION AND ANALYSIS C1F DISCOURSE 


The purpose of this chapter is to put the techniques of analy¬ 
sis embedded in the STUDENT program into a wider contest* and indi¬ 
cate how they would fit into a more general Language processing sys¬ 
tem, We will describe in this chapter a theory of semantic genera¬ 
tion and ana Lysis of discourse. STUDENT can then be considered a 
first approximation to a computer implementation of the analytic 
portion of the theoryj 'With certain restrict iona on the interpreta¬ 
tion of a discourse to be analyzed. It will be evident from the theo¬ 
ry why apelyeis is so greatly simplified by the imposed restrictions- 


A- Language as Cougwnicatian. 

Language is an encoding used for conmunicstion between a 
Speaker and a listener (or writer and reader). To transmit an 
"idea*", the speaker must first encode it in a message, as a string 
in the transmission language- In order to understand this message, 
a listener must decode it, and extract its moaning. The coding o: a 
particular message, M, is a function of both its global context and 
local context. The global context of a message ie the background 
knowledge of the speaker and the listener, including some knowledge 
of possible universes of discourse, and codings for some simple ideas, 

The local context of a message, M, is the act of messages tem¬ 
porally adjacent to H. X may refer hack to earlier messages. M nay 
even he just a modification of a previous message, and only under¬ 
standable in this context. For example, consider the second sen¬ 
tence of the following discourse; "How many chaplains are in the 
U.S, Army? How many arc in the navy?*' 

In order for communication to take place, the in£urination map 
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o£ both the listener and the speaker must be approximate!y the seme, 
at least fot the universe of -d la course j, also the decoding process 
Of the Listener must be in approximate inverse of the- encoding ptocess 
Of Che Speaker. Education in language is, in large part* an attempt 
to force the language processora of different people into 0 uniform 
ciold to facilitate successful communication.. We arc not proposing 
Chat identity in detail is achieved, but as Quine 80 nicely put it 
(37); 


"Different pereons growing up In the same language are 
like different hushes trimmed a n d trained to take the shape 
of identical elephants. The anatomical details of twigs and 
branches will fulfill Che elephantine form differently from 
bush to huohi hut the overall outward results are alike*"' 

As a speaker transmits successive messages concerning Seme 
portion of hi* information map, the listener who understands the mes¬ 
sages constructs a model of a "situation"*. The relation between the 
listener's model and the speaker's information map is that from each 
can he extracted the transmitted information relevant to the universe 
of discourse, including information deducible from the entire set 
of messages, Tire internal structure of the listener's Bsadel need 
bear no resemblance to that of the Speaker, end may in general con¬ 
tain far less detail. 


B. Theories of Lacsuase. 

According to Morris 1 theory of signs (33), the encoding and 
decoding of language nan ho stratified into three levels. Hie first 
Level is the eyfttactic which dealE with the relationships of signs 
to other signs. A syntactic analysis, treating word a aa meinhe r S □£ 
classes of words, can yield structurings of messages which indicate 
common processing features. The second level, aeinant 1c analysis, is 
concerned with the relationshipa of Signs to the thinga they denote. 
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A third Levels, £rjjgaatlc analysis, is concerned with the relationships 
between signs and their interpretations In terms cf actions required* 
Our theory will deal with all three Levels of analysis, with a pri¬ 
mary emphasis on the relation of the semantic aspect of language to 
the generation of discourse,, 

Many theories of syntax hay* been developed to describe the 
Structure of English, end m^ny of these have served os bases for 
computer programs -which perform syntactic analysis* For a complete 
survey of such systems sec the paper by Bobrow (4), .Almost all 
of these theories ignore the concepts of meaning and semdnt i CK* Be¬ 
cause they ignore such an important aspect of lenguiig.es, programs based 
on Such theories often yield many passible structurings Tor a single 
sentence which is unambiguous to a person* With some use of meaning, 
many of the meaningless SwibigUOtlE interpret at ions could be eliminated. 
For a good discussion of why ambiguities arise in syntactic analysis 
see Kuno and frettinger <25). 

Based on some ideas described by Yngve (46), a number of 
programs have been written which generate syntactically correct En¬ 
glish sentences. In most cases, the sentences generated are pre¬ 
dominately meanInsleaa nonsense* The coherent discourse generator of 
Klein <25) is the one exception I knew. Klein utilizes an Input test 
from which he extracta certain structural dependencies of the words 
in tire input. He then generates sentences and before they arc re¬ 
leased for output, a postprocessor checks to see if the words in the 
generated sentence satisfy structural dependencies consistent with 
those found in the input test. However, even in Klein's program no 
attempt is made to use the denotive Denning of any word „ except in 
so far as this meaning is reflected In its cooccurrences with otErer 
words in the input text. 
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Some theoiri.cS which do consider the problem of secn-:intics are 
hoing developed hok, Fendegraft <27} states that the programs being 
developed at the Linguistic hesearch Center of the University of Tex¬ 
as are an explication of Morris f theory of signs. Though not yet 
implemented, the semantic -analysis program will maStc use of a pre¬ 
liminary phrase Structure syntactic analysis. A number of syntactic 
Structures, with appropriate vocabulary items, will map onto single 
semantic constants, essentially indicating that these structures all 
have the asme meaning. This gives a type of canonical form for 
structures in terms of their mcbnings, but does not utilise any ex¬ 
plicit model of the world. No provision is made in the theory for 
deduction of information implicit in a set of sentences. 

Lamb (26) ai BQ has proposed a strat f f Uationa 1 theory of gram¬ 
ma not yet implemented on a computer, in which successive levels of 
analysis are performed, with a final mapping of the input into struc¬ 
tures in a "iememic" stratum of the language* In this sememic stra¬ 
tum arc bundles of "sememes" or meanings * end indications of the re- 
la tionships between different bundles. Different sentences which 
mean the same thing should map into the sane structure tn this semcmic 
stratum. Sememic structures are thus canonical representations of 
meaning. 


C. Definition of Coherent blscoutse. 

The theory of language generation and analysis which we shall 
describe below is designed to handle what we call coherent discourse. 
A discourse is a sequence of sentences such that the meaning of the 
discourse cannot be determined by interpreting each sentence inde¬ 
pendently, disregarding the other sentences in the discourse. The 
interpretation of each sentence wiy be dependent on the locol con¬ 
text > in the sense defined previously, A discourse is coherent if 
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it tvftS a complete and consistent interpret at ion* Completeness im¬ 
plies that there ii no substring within the discourse that does not 
have some interpretation in the model of the situation being built 
by the listener, 

A listener's ability to build £ model of a situation from a 
discourse is dependent on information available to him from his gen¬ 
eral store of knowledge, Therefore it is quite possible for a dis¬ 
course to seeni coherent to one listener and not soother* A writer, 
reading his own writing, may feel that ho has generated a coherent 
Sequence of sentences, but in fact, it is incoherent to ail otheT 
readers, this is, unfortunately, not a tare occurrence in the sci¬ 
entific literature. Conversely, a listener Who is £ psyehietrist, 
for example, may find coherence in a sequence of remarks which a 
patient h h <nks are entirely unrelated. 

The STUDENT system utilizes an expandable store of general 
knowledge to build a model of £ situation described in a member of 
a limited class of discourses, Hie form of this model of a situation 
built by STUDENT will be discussed in detail in a later section of 
this chapter. As far as I know, STUDENT ia the only computer im¬ 
plement £tion of e theory of discourse analysts now extant that maps 
a discourse into some representation of its meaning* When the theo¬ 
ries of Lamb and Fendegraft are implementedj they should siso be 
able to -analyse this class of discourse (and others) * Harris also 
talks about "discourse analysis," (20) but in tils use of this term 
be specifically excludes the use of meaning,. Stating: 

"The method (of discourse analysts] is formal, depending 
only on the occurrence of morphemes as disL Lngniehsble ele¬ 
ments, and not upon the analyst's knowledge of the particular 
meaning of each morpherne*" 
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D, The Use of Kernel Sentences in Our Iheory* 

A basic postulate of our theory of language analysis is that 
G listener understands a discourse by transforming it into an equi¬ 
valent (in meaning) sequence of simpler kernel Sentences. A kernel, 
sentence is one which the listener can understand directly; that 
is, one for which he knows a transformation Into his information: 
store. Conversely, a speaker generates a set o£ kernel sentences 
from his information map, end utilises s sequence of transformations 
on this set to yield his spoken discourse* This set of kernel sen¬ 
tences is not invariant from person to person, and even varies for a 
single individual as he learns. 

The use of kernel Sentences in this way is controversial* 
lEoweveri the theory ie proposed as a good framework for understanding 
and leap lemntlt ing language processing on a computer, not necessarily 
as a wudel for human behaviour, 'Ere usefulness of tki# theory ns a 
psychological model is an empirical question* Skinner (42) ha5 
given some psychologtea 1 justification for assuming the existence of 
a set of base Sentences, and Chomsky (7) has discussed the linguis¬ 
tic merits of the use Of the concept of kernel sentences* Despite 
this cniKirton concept of kernel sentences, in practice* Our u#e of 
kernel sentences is different than that of Skinner or Chomsky* 'Our 
use of kernel Sentences as a basis of a language la analogous to the 
use of generators in defining a group. 

Although we are not proposing opr theory as a basis for a psy¬ 
chological model, it has been useful, to avoid Circumlocutions* to 
describe the theory in terms o-f the properties and actions of s hypo¬ 
thetical speaker and listener, All statements about speakers and 
listeners should be interpreted as referring to computer programs 
which respectively, generate and analyze coherent discourse. 
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£. Generation of Coherent Discourse 


1) The Spe-Jker 1 * Model of the World* We assume that a 
speaker has some model of the world in his information store* We 
shall not he concerned here with how this model was built, or its CX-- 
act form* Different forms for the rao-del will be useful for different 
language tasks* hut they must all have the properties described below. 


The basic component* of the model ate a set of objects, ^|j 
s set o£ functions jh* set of relations I^R'^ J b a sot of pre- 

and a set of semantic deductive rules* A function 


positions 


M • 


is a mapping from ordered Sots of n objects b called the argu¬ 
ments of T 1 / » into the *et of objects* The mapping: may be multi¬ 
valued and is defined only if the arguments satisfy 0 Set of Con¬ 


ditions associated with F 


,n 


A condition is essentially membership 


In a class of objects, hut is defined more precisely below* A to- 
n 

la Line: is a special type of object in the model, and consists 

of a label (a unique identifier ) y and an ordered set of n conditions, 
called the argument conditions for the relation. Functions of re¬ 
lations are again relatione * 


An elcmcntdry proposition consists of a label associated with 
some relation, h!' , 3 nd so, ordered set of n. objects satisfying the 

L 

argument conditions for this relation, fine taay think of these pro- 
positions as the beliefs of a speaker about what relationships be¬ 
tween objects he has noticed ate true tn the. world* CompleK prn- 
positIons are logical combinations (in the usual sense) of elementary 
propositions * 

The semantic deductive rules give procedures for adding new 
propositons to the model based on the propbSitionS now in the model* 

Tn addition to the ordinary rules 0-f logic* thus* rules include axioms 
about the reletionship# of the relations in the model, The semantic 
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deductive rules also include lints Co the senses of the speaker* For 
example, one such deductive Tula for adding a propositon to the ncdcl 
Blight be (loosely speaking) "Look in the real world and see if if is 
true." These rules essentially determine how the model is to he ex¬ 
panded, and are the most complex part of a complete System. How™ 
ever, from our present point of vtev» wc need only eons ider these 
rules as a black box which can extend the set of propositions in the 
mode1. 


A closed question is a relational label for some 1!^ and an 
ordered set of n objects. The answer to this question is affirmative 
if the proposition, consisting of this label and the n objects,, is 
in the esodol (or can be added to it). If the negation of this pro¬ 
position is in the model (ot Can be added), Che answer is negative. 
OtEserwiso the answer is undefined. 

An open question consists of a relational label for an n-argu™ 
ment relation, R™ , and a Stft of objects cor respond ing to n-k i>£ these 
arguments, where nfeklL , An answer to an Open question is an or¬ 
dered set of k objects. Such thet if these objects are associated 
with thfc k unspecified .irgumcnts of R^ , the resulting proposition is 
iti ths cncdel or can be added to it, An open question may have no 
answers, or may have one or more answers. A condition is an open 
question with k=l, and an object satisfies a condition if it is an 
answer to the question, 

2) Oeaiet-afcEon of Kernel Rentcnees. He have described the 
logical properties of the speaker r s model of the world. We shall 
now consider how strings in e Language,, words, phrases* and sentences, 
are associated with the model. Corresponding to Che set of objects 
0 £ there is a set of strings (in English in our case), 

called the names of the objects* There is a many-one mapping from 
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{"*j} onto (°i3- V Is many-one beta use one object may have more 
than one name, e,g, frankfurter and hot dog both map back into the 
Bane object,in the model. 

Recall that functions map n-Euples of objects into objects, 
^luie a function name and an ti-tuple can specify an object. We 
can derive a name for this object from the function name and the 
nacaea of its tl orjjmEicTitfli■ Associated with each function ia at 
least one linguistic form, a string of worda with blttnk* in which 
names of arguments of the function must be inserted r Examples of 
linguistic forms associated with a model are "number of "i 

"father of_ ", and "the child of _ end. " + There is 

a many-one mapping from the set of linguistic forma fe} onto the 
set of functions. Two examples of multiple Linguistic form* for 

the same function are: "father of_" and " __ V s father"; 

and " plus _^_ w and "the sum of _ and_V. Thus^ 

if objects x and y have names "the first number" and "the second 
number 11 and associated with the function " * " is the linguist fc 

form "the product of_ and M * then the name of the object 

produced by applying the function 11 * " to x and y ia "the product 
of the first number and the second number"+ A parsing of a name 
thus must decompose it into the part which Is the linguistic fort:, 
and the parts which arc tiaenea of arguments of the corresponding func¬ 
tion. We shall call objects defined in terms of a function and an 
n-tuple of objects a functionally defined object , and those which 
are not functionally defined we shall call ^_^T^lc_ohj0CtS ■ Simple 
objects have simple names and functionally defined objects have 
ciMfipoalte names . 

In addition to Linguistic forms associated with functions, 
there are linguistic forms associated with relations. For an n ar¬ 
gument relation there are n blanks in the linguistic form, Examples 
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of relational linguistic- forms arei " equals ", 

"__ J^ve _ to _ " and rr __ speaks". It is this 

aet of linguistic forms, corresponding to the re1stions in the model, 
that serve as frames for the kernel Sentences, 

In a manner similar to the way composite names are built, a 
kernel sentence corresponding to in elementary proposition is con¬ 
structed, by inserting names Corresponding to esch argument in the 
appropriate blank. Names may be simple or compos ite + An example of 
a kernel sentence for a proposition built from Such a relational 
linguistic form ii "John r s father gave .3 times the salary of Bill 
to Jack*" ■which contains the simple names "John", "Bill”, 

and "Jack", It contains the functional linguistic forme ”_ 1 £ 

father”, "_times _" and "salary of " and the rela^ 

tionsl linguistic form " __ gave __ to "i 

A kernel sentence corresponding to a complex, proposition 
is Constructed recursively from the kernel sentences corresponding 
to its elementary propositions! constituents by placing them in the 

Corresponding places In the linguistic forms " ____ and _", 

11 _ or _", "not _" etc. 

The kernel sentence Corresponding to a closed question is 
constructed from the kernel of the corresponding proposition by 

placing it in the linguistic form "Is it true that __7" Far 

an open question, dummy objects are placed In the open argument po¬ 
sitions to complete a propositional fonn. These dummy arguments 
have names "who", "what", "where", etc., and which dummy objects ore 
used depends on the Condition on that argument position. A question 
mark ie placed at the end of the kernel sentence constructed in 
the usual vay from the relational lingurscit form and the nemes of 
the arguments. 
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In generating a coherent discourse, a speaker chooses S num¬ 
ber of propositions in hi# model end/or seme open or closed ques¬ 
tions . He then uses linguistic information associated with the model 
to construct the set of keenel sentences corresponding to this sot of 
chosen propositions* In the neKt section we will discuss how he 
generates his discourse from this set of kernels. 

3) Ira ns forma tions on Kernel Sentences. The set of kernel 
sentences is the base of the coherent discourse* The meaning of s 
kernel sentence is the proposition into which It maps t and simi¬ 
larly., the meaning of any name is the object which is its Image un¬ 
der the mapping. To this set of kerneis wc apply a sequence of 
meaning preserving transformations to get the final discourse* We 
use the word "transformation" in its broad general Sense, Hot in 
the narrow technical sense defined by Chomsky (7), 

There are two distinct types of transformations* structural and 
definitional* ft 3 tr«ctoral or syntactic transformation is only de¬ 
pendent on the structure of the kernel fitring(a) on which it operates. 
For example, one syntactic transformation takce a kernel in the ac¬ 
tive voice to one in the passive voice. Another combines two sen¬ 
tences into a single complex coordinate sentence* 

■One large class of syntactic transforms!ions is used to sub¬ 
stitute pronominal phrases for names. Pronominal phrases may he 
ordinary pronouns such as 'lie"* "she 11 , or "it"* They moy be refer¬ 
ential phrase# such as "the latter 11 , "the former" or’this quantity"* 
They may also be truncations of a full name such as "the distance" 
for "the distance between Hew York and Los Angeles". In cases whore 
such pronominal reference is made, the Coherence of the final dis¬ 
course is dependent on the order in which the resultant strings 
appear. 
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thu Second type of transformation is definitional* It In¬ 
volves substitutions of linguistic fittings and In ms for ones ap¬ 
pearing in the kernel sentences* For example, tor any appearance of 
tinea >p uo may Substitute r, twiec'\ and for 11 ,5 times" substitute 
"one half of"* In addition to this string substitution, some trans- 
formations perform form substitution and rearrangement. For example, 
for e korneL sentence of the form M x is more than z" , where x, 
and i ate any names, one definitional transformation can substitute 
"x exceeds z by 

Some transformations are optional, and some tiey be mandatory 
££ certain forms arc present in the kernel set. Certain transforma¬ 
tions are used by a speaker for stylistic purposes, for example, 
to emphasize certain objects; other syntactic trsns fermutions such 
as those which perform pronominal substitutions art used because 
they decrease the depth of a construct ion, in the sense defined by 
Yhgye (44). 

Let us review the steps in the generation of a coherent 
discourse. The speaker chooses a set of propoe itions „ the ** ideas" 
he wishes to transmit, He then encodes them aa language strings called 
kernel sentences In the menner described above. He then chooses a 
sequence of structural and definitional trana formations which are 
defined on this set of kernels or on the ordered set of sentences 
which result from applications pf the first transformations* The 
resultins aequtnee of sentences will be a coherent discourse to a 
listener if he knows all the definitional transformations applied* 

In addition, for every pair of distinct names which the speaker maps 
back into the same object, the listener m^st also cap into a single 
object. 


In order to clarify thia theory, we show, in Appendix E, a 
sample semantic genorative grammar which will generate coherent dts- 
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course understandable by the STUDENT analysis program* The ob¬ 
ject-? are number* and the functions arc the arithmetic operation® 
of fining difference, product and quotients The only relation in 
the model is numerical equality. The tranefometions are described 
informallyj further 1inguletic investigation ia necessary before a 
formal notation for trsnflformations can be decided upon. Parallel 
to the gr arras i ia a sample problem generated by utilising, this gram¬ 
mar. Tbis problem la solvable by the 5XUDENT eystem. 


F. Analysis of Coherent Piscourse. 

Goneratton of coherent discourse consists of two distinguish¬ 
able step?-. From propositions in the speaker r £ oodel of the world, 
he generates an ordered set of kernel sentences. Ho then applies a 
sequence of tranaformations to this kernel set* The resulting dis¬ 
course iE a coded message which is to be analyzed and decoded by a 
listener. The Listener's problem can be loosely characterized as an 
attempt to answer the question, "What would I have meant if I said 
that?" 

Ita analyze a discourse the listener must find the set of ker¬ 
nel sentences from which it was generated; one way to da this is 
to find a set of inverse transformations which when applied to the 
input discourse yield a sequence of kernel sentences. The listener 
must then transform these kernel sentences to an appropriate rep¬ 
resentation In his information store- The appropriateness of 4 rep¬ 
resentation is a function of what later use the listener expects to 
isake of the information contained In the discourse. The listener 
may simultaneouslv transform o givan kernel sentence into a number 
of different representations in his information store. On a level 
of pragmatic analysis, statements require only storage of informstion. 
Questions and imperatives require sppropriete responses from the 


14 




listener. The d ifficult ies in analysis dichotomise Into those 
associated with finding the kernel sentences which ate the base of 
the discourse, and those associated with trans forming the kernel sen¬ 
tences into representatIons in the information store, 

Mathews {29} ha# suggested that analyst a can he performed by 
aynthea ia, A sequence of kernel sentences, and a sequence of trans¬ 
formations are chosen„ and the transformations are applied to the ker¬ 
nel sentences + The resulting discourse is matched- against the input, 
If they arc the some, these kernel sentence* and transformations give 
the required analysis of the input. If not, a change is made so that 
the resulting discourse becomes more like the input. 

If the kernel sentences and traps format ions were chosen, ran¬ 
domly , this method would obviously be too inefficient to work in 
any practical sense. However, by ytiltiling clues within the input 
discourse, the choice of kernela and tranSformat ions can be greatly 
restricted. This technique of sentence analysis is being implemented 
in s program being, written at KITHE by Walker and Bnrtlett (43). This 
technique has the advantage that enactly the same graimar can be 
utilised for both analysts and generation of discourse. 

A more direct analytical approach would utilize a set of in¬ 
verse analytic transformations, If T is a transformation that may 

be used in generating a diacousree, and T ( (S) * 5, where S and ~ are 

m 1 

sets of Sentences, then the analytic transformation T. ia the in- 

-1 — 1 

verse of T t if and only if T (S) = 5 , The choice of which In¬ 
verse transformations to apply and the order of their application 
,+,ay again be restricted by utilising heuristics concerned With 
features of the input. 

Once the base set of kernel sentence# for * given dis- 
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course is determined, there remains the problem of entering rep¬ 
resentations of these sentences in the listener's informat ton store- 
The major problem in accomplishing, this stop involves the separation 
of these wards which are part of linguistic forms for relations > and 
those which ere part of a name. This is difficult because the game 
word (lexicographic, aymbo 1) may have multiple uses in a language■ 
Having separated the relational form from the names which represent 
the arguments of this relation, one can then analyze the name in 
terms of components which are functional linguistic forms and others 
which arc simple names. From tliia parsing in terms of relational 
linguistic forms t functional linguistic forma and simple names, the 
discourse tan he transformed Into s canonical representation in the 
information store of the ] listener. 


C. Limited Deductive Models - 

A complete understanding of a discourse by a listener would 
imply that the see present at ion of the d i a Course in Me information 
store is essentially isomorphic to the speaker r 9 model of the world, 
at least for the universe of discourse- The listener's representa¬ 
tion must ptc9erve all information implicit in the discourse. 

If the listener Is only interested in certain aspects of the 
discourse, he need only preserve information relevant to his interest 
and discard the rest. Within his area of interest the listener's mod 
el 16 Isomorphic to the speaker's model in the sense that all rele¬ 
vant deductions which can be made by the speaker on the basis of the 
discourse can also be made by the listener- Outside this area of 
interest, Ehe listener will he unable to answer any questions- We 
CsLl such restricted information stores limited deduc tive models- 

The quest ion-answering programs of Lindsay and Raphael, end 
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Che STUDENT system, fill utilise limited deductive m^els, For the 
area of interest in each of these programs there was si ''natural" 
representation fur the infornnat ion in the allowable input. These 
representations were natural in that they facilitated the deduction 
of implicit information. Fur example, Lindsay*a family tree rep¬ 
resentation made it easy to compute the relationship of any two in¬ 
dividuals in the tree, independent of the number of sentences nec¬ 
essary to build the tree. 

Because the number of eolations and functions expressible 
in the models in all three ayst-ems is very limited* there ia a 
corresponding limitation on the number of linguistic forms that may 
appear in the input. This greatly simplifies tbo parsing problem 
discussed earlier t by restricting alternatives for words in the 
Input text. 


11 - The STUDENT Deduct ivo Model r 

The ETLIEEST system is an implementation of Che analytic por¬ 
tion yf our theory. STUTlETn performs certain inverse craneformat i ons- 
to obtain a set of kernel sentences and then transforms these kernel 
sentences- to expressions in a limited deductive model. Utilizing 
the power of this deductive model, within it? limited domain cf under¬ 
standing, it is able to answer questions based on information im¬ 
plicit in the input information, * 

The analytic and transformational techniques utilized in 
&TDHENT are described in detail in Chapter Jv* We shall describe 
here the canonical representation of objects, relations and func¬ 
tions within the medel. STUDENT is restricted to answering questions 
framed in the context of algebra skury problems. Algebraic equa¬ 
tions are e natural representation for information in the input. 
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The objects in the model are numbers, o* numbers with an as¬ 
sociated dimension. The only relation in the model is equality,,, aod 
the only functions represented directly in the model are the arith¬ 
metic operations of addition, negation, mult ip1ication, division 
end exponentiation. Other functions are defined in terms of these 
basic functions , by cosopostion, and/or substitution o£ constants 
£or arguments of these functions* For example, the operation of 
squaring is defined as exponentiation with ''2 11 as the second argu¬ 
ment of the exponential function; subtraction is a composition of 
addition and negation. 

Within the computer, a parenthesized prefix notation, is used 
for a standard representation of the equations implicit in the En¬ 
glish input, The arithmetic operation to be expressed is itade the 
first element of a list, and the arguments of the function are suc¬ 
ceeding list clenients. The exact nutation is given in figure 1 below* 


On c r a eIon 

Infix notation 

Prefix Notation 

Equals ty 

A ■ B 

(EQUAL A B) 

Addit1on 

i + E 

A 4- B + C 

(PLUS A B) 

(PIUS ABC) 

Uegation 

- A 

(MINUS A) 

Subtract ion 

A - B 

(PLUS A (MINUS B» 

Mult ip Heat ion 

A + E 

A * E * C 

(TIMES A B) 

(TIMES A B C) 

Division 

A / E 

(QUOTIENT A B> 

Exponentlacton 

a B 

(EKPT A E) 

Figure 2: Nutation 

Within the STUDENT 

Eeduc tlva Hod el 


In the figure. A* B t srnd G are any representations of objects in the 
model, either composite or a imp la nanw-S, Tbs usuai infix notation for 







these functional expressions is given for c reap a ris on. Kecays^ this 
is a fully parenthesized, notation, no ambiguity of operational order 
arises t as it does h for example, for the unparenthesijed infix nota¬ 
tion expression A*B#C or its corresponding natural language expres¬ 
sion ff A times B plus C". Mote also that in this prefix notation £lns 
and times are not strictly binary operators* indeed, in the model 
they nay have any finite number of arguments f e,g. {TIHKS A E C D) 
is a legitimate expression in the STUDENT model. 

Representations of objects in the STuDfchTT deductive model 
are taken from the input. Any string of words not containing a 
linguistic form associated with the arithmetic functions expressible 
in the model are considered simple names for objects.. Thus, 11 the age 
of the child of John and Jane 1 ’ fa considered a simple name because it 
contsine no functional linguistic ferm$ associated with functions rep¬ 
resented in STUDENT'S limited deductive model* In a more genera1 
model it would be Considered a composite name, and the functional 

forme "age of _" and "child of _ and _" would be 

mapped into their corresponding functions in the model. 

Because Such Complex strings are considered simple names in 
the mode1, and objects flre distinguished only by their names, it 
is important to determine when two distinct names actually refer to 
the same object* In fact, answers to questions in the STUDENT sys¬ 
tem are statements of the identity of the object referenced by two 
names. However, One of the names (the desired one) must satisfy 
certain lexical conditions. Must often this condition is just that 
the name be a numeral. For a more general model this restriction 
could he stated as requiring a simple name corresponding to some 
functionally defined name — because* for example, "number of ri 

would be a functional linguistic form in the general model, and the 
only simple name for such an object would be the nmral corres- 
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ponding to this number, An answer consists of a statement of 
identity e.g. ’"The number of customers Tom gets is 162." 

Hie other lexical restriction on answers sometimes used in 
the STUDENT system is insistence that a certain unit (corres¬ 
ponding to a dimension associated with a number) appear in the de¬ 
sired answer. For example, spans is the unit specified by the ques¬ 
tion "Elow many spans equals 1 fathom?", and the answer given by 
STUDENT is "I fathom is B spans". 


Hie deductive model described here is useful for answering 
questions because we know how to extract implicit information from 
expressions in this model; that is, we know how to solve sets of 
algebraic equations to find numerical values which Satisfy these 
equations. The solution process used in STUDENT is described In de¬ 
tail in Chapter VI. The transformation process, bused on the theory 
described earlier, which STUDENT uses to go from an English input 
to this deductive model, is described in Chapter IV, 
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CHAPTf ^- III; _ FHQC R AMMlNfl KQRKALISKS AND LANGUAGE WANIPULATION 


ALmoit any programing; language Is universal in the sense that 
with enough tine, sp ^, and work at the impLementation. any computable 
function may be programmed. However, Che task of programming can be 
“ Je much easier by the- peeper choice of a higher level problem ori¬ 
ented programing language. The data to be manipulated by the STU¬ 
DENT system is Symbolic, and of indefinite length and complexity. For 
this reason , a Ust-process ictg language w^s the rno*t appropriate type 
Of programming for this task. There art a number of such languages 
available, each having its own set of advantages and disadvantages, 

Fot 0 (iescr ipticn of the general properties of list-process ing lan- 
guages, with a detailed cotnparison of four of the better known list- 
proceeBing languages, see Bobrov and Raphael < 5 ). Mostly because I 
knew it so well , X those LISP (3-1) as the basic language for the STU¬ 
DENT System, 

The LISP formalism is very convenient for prograrmifng recursive 
tasks such as the solving of a set of simultoneoua equations. However, 
USF does not provide any natural mechanisms for representing manipula¬ 
tion of strings of EngLiah words, another very important suhtask in 
the STUDENT system. For this type of manipulation one would like to 
perform a sequence of steps involving operations such as recognising 
a sentence format which fits a particular pattern, finding certain ele¬ 
ments in a sentence by their contest, rearranging a string of worda. 
deleting, inserting, and duplicating parte P f strings, and others. 

The LISP forma Liam cannot easily express egch string manipula¬ 
tions, though each could be individually programmed. However, a for- 
nioliam foe just this sort of cianipulat ion is the basis of the COMIT (45) 
programming system. RuLea in this formalism can easily express very 




complex string, manipulations, and are easy to read and write. How¬ 
ever, CQKIT and LISP cannot be uSftdi e imuItsmeous l y, and the problem 
context necessitate! going beck and forth between LISP-or lented tasks 
and oatflT-or tented tseka. Therefor* , I adapted the OOktIT rule nota¬ 
tion for use in LI&F„ and constructed a LISE program called METEOR which 
wo«Ld interpret string transi format ion rulea in this notation. 

In constructing the METEOR interpreter, I effectively extended 
the eloquence of the LISE programming language; that ie, operations 
which could be done previously, but were awkward to invoke could now 
be expressed easily+ An extended Language embodying the beat features 
of OOMJT and LISP Could have been Wit from scratch, hut it is much 
more economical to achieve such extonsione by embedding. The advan¬ 
tage! and disadvantages of language extension by embedding are discussed 
in detail by Eobrow and Ueizenbaum (&). 


A. Specifying a .Desired String Format. 

METEOR has been described in detail elsewhere (3), but we in¬ 
clude here a brief suranary of its feature-e . We do this because use of 
the notation makes later explication of the transformation process 
esEier. In addition, If any ambiguity becomes apparent in the expla¬ 
nation bf the opera Hun of STUDENT, it may be resolved by consulting 
the listing of the STUDENT program In Appendix B + In this latter 
case, it "lay be necessary to consult the more complete specification 
of METEOR referenced above. 

A H&TEOh program consists of a sequence of rul*e each specifying 
g string transformation and giving, some centre 1 ! information. Lot uR 
first consider how a string transformsi-bd ia specified, We shall 
■call the string to be transformed the workspace ■ The workspace will 
be trains formed by a rule only if It metche& a pattern or forui'-Tt gWeu 
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in the ‘'Left half" of the This left half is e list of ele¬ 

mentary patterns which sped fits a sequence of items that must be 
matched in the workspace. For example, if the Left heIf were 
"(l^E BOY) 1 ' then s match won Id be found only If the workspace con¬ 
tained a "THE" immediately followed by "BOY ri - In addition to 
known constituents j, one can match unknown constituents + The ele¬ 
ment $1 in a left Half will match any one workspace Constituent. The 
left belf 11 (^ B $2 C)will match e cent Ignitus substring of the 

workspace which consists of an A followed by exactly one constituent 
(specified by th* marker "$ 1 ") followed by a & followed bv exactly 2 
constituents (notching, the "$Z") followed by an occurrence of a 
Thus $1 wttl mutch an element of the workspace with a specified Con¬ 
text - If a left, half would match more than one substring in the 
workspace, the Left-most such substring is the one found by the 
matching process. 

&fe Have discussed elementary patterns which match a fixed num¬ 
ber of unknown Constituents (e.g., 11 $3 tl matches J unknown constitu¬ 
ents). METEOR also he* an elementary pattern element "£ 1H which 
matches an arbitrary number of unknown constituents. For example, 
the left half (THE § TOY) will mstoh a substring of the workspace 
Which starts with an occurrence of "THE 11 followed by any number of cun 
atitucnt-S (including zero) followed by an occurrence of ,r BOY" . It 
would, for example, mutch a substring of the workspace "(GIVE THE 
GOOD BOY) 1 ’ or of the workspace "(THE BOY HERE)" . If the left 
half ($ GLITCH ?3) matches a substring of the workspace-, then the 
elementary pattern "S" catches the Substring from the beginning of 
the workspace up to but not including the first occurrence of "GLITCH 1 ' 
the pattern "'GLITCH 11 matches this occurrence of "GLITCH" in the work¬ 
space; and the elementary pattern "$3" matches the 3 elements or 
tonstituents of the workspace immediately following GUXTCK■ 


4 3 


Elements In the workspace may be tagged Of subaerlpted to in- 
dicste special properties of this element; for example, one might 
have {HAVE/VRR&) or (B0Y/N0L1N) as elements of the workspace. Such 
elements can be matched by name fusing HAVE Of BQY fis pattern elements), 
or identified j,uet by their subscripts {of by both). The elementary 
pattern ($1/VE,R.R) will match any single constituent which ie a verb; 
that is* one which has the subscript "VERB”, even if this constituent 
has other subscripts. Thus the left half (ALFHEb (31/YERB) BOOKS) 
will match the substring (ALFRED (REAPS/VHRB) BOOKS) in the work¬ 
space (NW ALFRED (READS/VERB) BOOKS IN TUB LIBRARY). 

Other elementary pattern dements are provided , end new pat¬ 
tern elements can be defined and cosily used within the HETEDE. system. 


B^ Specifying a Transformed Workspace, 

We have discussed how a desired format can be specified through 
1 prototype pattern, called a left half- If we try to match the work¬ 
space to a left half* but it is not In the format specified* we say 
the match has failed . If a substring of the workspace is in the speci¬ 
fied format, the match is successful. When there is a successfui 
match, we may wish to transform or manipulate the substring matched, 
or place in a temporary storage location, called a shelf, copies of 
segments of the matching substring. We shall now discuss the nota¬ 
tion used for specifying such t fansformat lone* and storage of material, 

A left half is a sequence i>f elementary patterns, and we associ¬ 
ate with each elementary pattern a number indicating its position in 
this left half sequence* for example* in the left half {$2 D $■ E) * 
the first elementary pattern. Si, would bo associated with the number 
1, the second, h* with i. $ with J, and I with 4. If a match is suc¬ 
cessful, each elementary pattern element in the left half matches a 
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pirn of Che substring of the workspace snatched by this left half. The 
part matched by a n elementary pattern can then be referenced by the 
number associated with this elementary pattern. For the left half 
given above, and the workspace (A B C D H E G) , the- left-half .natch 
succeeds, snd the substring (E C) may then he referenced with the num¬ 
ber 1, the substring (D) by 2, {£ A) by 3, and (E) by 4, 

The t rafts formed workspace is specified by the "right half r[ 
nf a HETEOK. role. This right hslf may be just the numeral 0, in 
which case the matched portion of the workspace is deleted. Other¬ 
wise this rijjht half must be a list of elements specifying a replace¬ 
ment for the matched substring. Any numbers in this right-haIf li*t 
reference (specify) the appropriate part of the matched substring. 

Other items in the list may reference themselves, or strings in tem¬ 
porary storage, or functions of any refertuneable substrings. In 
the example discussed above, if the right half were (3 2 M2H), then 
the matched portion of the workspace would bo replaced by (E A D M D H), 
and the workspace would become (AEADMlHG). Note that 1 and 4 
ware not mentioned in this right half and were therefore deleted from 
the workspace - Also 1 and 2 were in reverse order, and thus these 
referenced parts were inserted in the workspace in an order opposite 
to that m which they had appeared, 2 is referenced twice In this right 
half and therefore two copies of this referenced substring, "(D)" ap¬ 
pear in the workspace. The elements M and H in this right half refer¬ 
ence only themselves, and are therefore inserted directly into the 
workspace, 

Using the right-half elements described, that t* 3 numbers 
referencing matched substrings and constants (elements referencing 
themselves), one can express ironsformat Jons of the workspace tn 
which, elements have been added to, deleted from,duplicated in, and 
rearranged in the workspace, Elements to he added to the workspace 


thus far can only he constants , Let us consider some other possible 
right-half elements* They are all indicated by Hats which start with 
special flags. 

The contents of any shelf (temporary storage list) can be 
referenced by a two element list with first element either *A (for ALL) 
or *51 (for Next}, and a second oleirient, the shelf name. For example, 
(*A EQT) references the entire contents of a shelf named EQT- If th*s 
element appeared in ■* right half, the entire contents of that shelf 
would be placed in the corresponding place in the workspace* The 
first element of a shelf named SBtt£tKE$ could be put into the work¬ 
space by using the element (*~H SEETEM^KS) in a fight half* 

The flag FS as the first member of a list eervins a right, 
half element indicate# that the next member of this list is a function 
name* and the following ohcs are the arguments of thl# function* The 
wain* of the function for this set of arguments is placed in the 
workspace. In this way, any blSP function can be used within a HETEQK 

rule ■ 


The flag *K indicates that the rest of the list following is to 
be evaluated aa a right-beIf rule, and then is to be ^compressed" 
into a list which will be a single clement of the workspace* Thus* 
Chunks which are Longer, a Lid have more complex structure than a 
Single word can be treated as a single unit within the METEOR 
workspace string- The inverse operation is the expansion of a thunk 
so' Chat all its component# appear as individual constituents in the 
workspace. Expansion is indicated by a flag at the beginning of 
a right-half clement list. 

We have thus far discussed how the tr^nsformation of a string, 
called the workspace* can be expressed in terms of a left half which 
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is a pattern for a desired input format, find a right half which is a 
pattern for the desired output format-, There is no reason to limit to 
one the number of outputs from a single left half natch. In fact, a 
third sectlem of a METEOR rule, tailed the '"routing section" (for 
historical reasons), allows the programmer to give any number of oth¬ 
er right Halves, and place these referenced Lists at the beginning or 
end of any shelf {temporary storage list). The storage of such a 
"right half" is Indicated in the routing Section by a list starting 
with a *$ or a +Q, followed by the shelf name, and followed by a 
right half pattern,. The *£ indicate* that the referenced n»a t e r ia 1 is 
to be Stored on the beginning of the named shelf. *Q indicates that 
it should be Queued on the end of the shelf. Used with s -+N for re¬ 
trieval, a shelf built up by a *S is a pushdown list, (a last-in- 
first- out list), and a shelf built up by a *q is a queue (first-in- 
first-out list) , 

The only other Significant feature o£ a KE-TEOR program that we 
have not yet touchc-d on is the control structure in a set of rules - 
A METEOR rule has s name, and has a "go-to 11 section. Ordinarily, if 
the left-half match fails, control is automatically passed tu the 
next rule In sequence. If the left-half match succeeds, the right half 
and routing sections are interpreted, and then control is passed to 
the rule named in thf "go-Co". However, by insertion of a "**’■ 
immediately after the ruie name in the rule, the method of transfer of 
control is switched, and only on left-half failure will control pass 
to the rule named in the "go-to". 

Routing Control can also be changed by a list of the form 
"(*D nanel na»e2) rr in the routing section of o rule. After this list 
is interpreted, any occurrence of name! in t "go-to" will b* inter¬ 
preted as n "go-to" containing "nsmeZ". This latter feature alW 
easy return from: subroutines - The use of left-half success or failure 
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Ps a switch fpr tli* transfer of control makes it possible to write sig- 
nlfLednt one rule loops. 

A METEOR program is a sequence (list) of rules, Each rule is 
a list of up to six. elements. The following is Ein example of a METEOR 
rule containing all six elements: 

(NAME * {$ BOY) (2 1) ( / (*S £1 1 1} (*D PI F2)) FL) 

Wc shall briefly review the function of e-adh of these six elements. 

Tbfi first element of a METlXJK role is a name, and must be present 
in any rule* If no name is needed, the dummy name can be used. 

The second element Is a "**" and is optional. When it is present it 
reverses the switch oia flow of control, and transfer of control to the 
ryie named in the "go-to 1 ' is made on Left-ha If failure . 

The third element is mandatory, and is a left-half pattern 
which is to be matched in the workspace. The fourth element is 
opIionaI, and is a right-haif pattern specifying the result in the 
workspace of the string transformation desired. The fifth {optional) 
element la called the routing section, and is a list flagged with 
a ’7" as a first element. The remainder of the touting Section 1? a 
sequence of lists which specify operations which place items On 
shelves or L "xu-ti." values . The final cIckiuiI ni I U"l : lie "go¬ 
to” and specifies where control is to be passed if a match succeeds 
(in the normal case). A in this position specifics the neyt rule 
In sequence. 


C« Summary* 

In this chapter*. we have briefly suHiflarised tSie f(a[ui , c* of a 
language for string manipulation which has been embedded (by building 





the M.KTEOR interpreter) in the general list-processing Language LISP. 
The ability to describe easily in METEOR the string transformations 
needed to process English sentences, and also use, ufc*re appropriate, 
the functional notation of the general last-processing Language, LISF, 
was a grfcdt advantage In the programming effort Involved in this studv 

AS a final illustration of the power of the combined METEOR-LISP 
language, we include a program for Wang's algorithm for proving 
theorem* in the propositional calculus. This algorithm is described 
on pages 44-45 of the LISP manual (31). and a LISP program for the al¬ 
gorithm appears on pages 4E-50. Figure 3 below Contains the complete 
METEOR program far the algorithm, including definitions of four 
small auxiliary t,I$F functiona used within the METEOR program. 


In addition the figure contains a trace of the program it 
provea the theorem given after the first line containing ”(THIOREM) H1 . 
The ether lines give the theorems that are proven by the algorithm as 
steps in the proof of this theorem. This METEOR program compares 
quite favorably in both size and underataudibility to the one given in 
the LIS? manual, and to the one COHIT program which I hove seen which 
performs the Wang algorithm. 


U1 

3= 


cf 


rt 



_■ 

O* 




CP 





bl 






iyl 



■bjH 




*r 


JZ 

«!-■ 



j— i 








p™i 

j— u 



> 

i*% 

cd 



cf 

a 



H 

0*1 

■i 

,r» ■-■ 

u 


cf 



13 

Cf 

c- 

«#«** A 

MO- 




Cf 


■fc-r 

“ *»-r* 

ft 


CL 



l^i 

Ql 

h 

~y j«i ,1—i. 

3 

Q 




u 


tn 

— W 







> 

iU 

1- K 


4 




n 

IV 


w a; 


■u^ 




Kf 

3 

■wr 

m < 

H « 


Cf 




Q P-. 

r-s < a 

c 

L'T> 

Ul 

.—. 


a. 

LU 

Z X 

>QfJ 

6 

•uJ 

!W 

<? 




o o 

< <( 





a 


CJ CJ 

ku c_j 

fej 

_J 

n 

fl_ 


IT 


u *1J- 

i_r “_r *w 

d 


Cf 



< 

□f 

- n - 

|« M 


E 


> 

j—■ 

s—r 


z ^ 

C3 W W 

’W 


c_ 


E3 - 


a. 

es 

z- 

O —i «J 

O 

•mJ 

O 

■Cf 

LL. 

UJ 

o 


LI--J- **- 

l 


•UJ 


z 

srt t£ 

1_B 

o 

< 

%J!« 

> 

—1 

< 

3i ^ 

r^i 

X. 

c£ 

—H 



G_ 

-_- 

^ -l 

M 

H 

a 


3 


X 

a 


r«i >r Bu. ■gu> 


■c 

ifl 


Qf pBu 

1*^ — 

■* “ 

.— ■ 



L-J 

C£ 

r 1 

-UJ “J 1 

£L_ w 


ef 

e. 


■cr 

W 


vOf 

i_r IlLT 

Vi vjw 


r-i X ■£ -t 
W Q □ 
m Eli El 

*1^ 


I o ■ 


rJIflQ 
■-- 7= 

O IH 
h-LDG 

ac ec a: 

■rC -X *1 


llJ 

B ** 

UJ LL 

X -Zt 


p 

s< 

J=« 


o' 


! " “i 


j 

a 

oc 

oe 

L‘n_f 

a 

O li. 
L'J 

P< 


S s 

5 S 

!“-i cr ' 

! rta cr 

i 

iiiji 


s5 

AC 

l£ 

< 


S :, -. 

u_ r> 


■£ I- i— 


A 

£ 

i_r 

r*r 

£ m 
uJ 

a; S 

SS 
£ * 


> 

n=> -™ 

* S' 

O L_i 

Of ^ 
ad 

■C CD 

i S ?r- 

£ A £ 


tf 

» 


It 


5 


n± 

3 

lL 

S 


□; E is , 1 _. 

^ “j, j-f 

rj t=l ft S(,iJ U A 

3 " C I 3 SI -J ■_ 

Jrr -_J I— Lh_ fc-> Pi, F 

Sj. v WW ■—r i_r -i_*-W W ’—■ 




9 


¥ 

LlJ 

j" 

n 

fi T 


Hi 

£ 

-C 

£ 

V 


% 

jL, pi rv, li 

y S £ £ £ 


sc 

-t * 

E: 


w<* Z.* 

2 rr,S 

‘=* 


E£ £ 

w ^ K 


ip 

r—r u- 

f* U 

<1 

csr 

u 

ys 

? 

o 

3 


A 

H»l -1_- 

e* . 

Cf 

n_j> 




St 

CP 

s 

lid 

3 

a 

=T r_a 


£ 

V 

<%j 


"■■ ^ * K ■-■■ **3 C3 

ES 5 p,£~«3 

a^s 

'£ *s v ssM^ 

*_u» 2 »CBjk 

m .-. i r - i — =T « it-h- i"i 


irt-e^.w— H! 


.^1 


3 C c 

s 

~Si£ i 

= 63“ a 

LJ . n — 
&! 

1^1 
“ u u u 


E-sp^ v |i 

*5^13 


sjsi 

vNwTJI 


’'P 2 a e — £ *■ i"* 

uaSu^to F *? 
LnZu 


«-5SE SSESE-gs^J^'Kjs:^ 

"* « s| :t!:j=s! ! {Jririf 

t BE ._ 5 iA M 


r r t lx! ■*—* I 

k 5 


■HJ 

vr w 

Bg* 

„ 

l*J i Ij f < 

□ V w 


UJ 

Sh 


3 3 


H> 

03 


Figure 3: A METFOJt Progr.-im Car the Whtir A 1 gor lehat 


50 













CHAPTER IV: THASiab-tlHHATlQN OF ENGLISH IQ THE STUDENT DEDpCrEVH MODEL 


The STUDENT system eonsfits o£ Cwc main subprograms t called 
STUDENT and PBHWm. The program called 5EHEHBIER accepts end pro¬ 
cesses statements which contain global information; ttiAt ls, in¬ 
formation wtsich is not specific to any one story problem* We shall 
discuss the processing and Information storage techniques used 
in FlEMEMSEIK in the next chapter* A listing of the global informs- 
tion given to the STUDENT system may be found in Appendix C. 

In this chapter, we shall describe the techniques embedded in 
the STUDENT program which are used to tIranifora an English statement 
of an algebra story problem to expressions iTl che STUDENT deductive 
model* By implication we are also defining the subset of English 
which is "understood 1 " by the STUDENT program, A more explicit des¬ 
cription of this input language is given at the end of the chapter. 


A. Outline of the Operation of STUDENT* 

to provide perspective by which to view the detailed heuristic 
techniques used in the STUDENT program, we shall first give an out¬ 
line of the operation of the STUDENT program when given a problem to 
solve. This outline is a verbal description of the flew chart of 
the program found In AppendI k A. 

STUDENT is asked to solve a particular problem* Wc assume that 
all necessary global information has been scored previon-sly, STUDENT 
will now transform the English input statement of this problem into 
expressions in its limited deductive model, and through Appropriate 
deductive procedures attempt to find a solution. More specifically, 
STUDENT finds the kernel sentences of the input discourse, and trans- 
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forms tht-S sequence of kernels into a set of simuItareouS equations, 
keeping, a list of the answers required, a list of the units involved 
in the problem (e,g, dollars h pounds) and a Lift o£ all the variables 
(simple names) in the equations-. Then STUDENT invokes the SOiiVK program 
to solve this sot of equations for the desired unknowns* If a solu¬ 
tion is found, STUDENT prints the values of the unknowns requested in 
a fixed format,, substituting in " (variable IS va Iuc )” the appropriate 
phrases for variable and value . If a solution cannot ho found, 
various heuristics are used to identify two variables (1,0. find two 
slightly different ptitaicS that refer to the same object in the mode 1). 

If two variables, A and B, are identified, the equation A ■ E is added 
to the set of equations. In addition, the store of global informat ion 
is searched to find any Equations that may be useful in finding the Solu¬ 
tion to this problem, STUDEKT prints: out any assumptions it makes about 
the identity of two variables, a^d also any equations that it retrieves 
because it thinks they may be relevant. If the use of global equa¬ 
tions or equations from identifications lends to a solution, the an¬ 
swers are printed out in the format described above ■ 

If a solution wa# not found, and certain idioms -are present in 
the problem (a result of a definitional transformation used in the 
generation of the problem), a substitution is made for each of these 
idloma in turn and the transformation and solution process is re¬ 
peated. If the substitutions for these idioms do not enable the prob¬ 
lem to be solved by STUDENT, then STUDENT request# additional Informa¬ 
tion from the questioner, showing him the variable# being used in the 
problem. If any information is given, STUDENT trios to solve the prob¬ 
lem again. If none l& given, it reports its inability to solve this 
problem and terminates. If the problem is ever solved, the solution 
is printed and the program terminate®-. 
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B- Cii^EKOiies of Words in s Trans forma t lsn. 

The words and phrase b (atrings of words) in the English input 
C:an be classified into three distinct categories on the basis of how 
they dre handled in the transformation to the deductive model. The 
Hirst category consists of strings of words which name objects in the 
model; I call such strings, var iaMes * Variables are identified only 
by the string of words in them* and if two strings differ at allj they 
define distinct variables- One important problem considered below 
is hew to determlne when two distinct variebles refer to the s a me o b= 
j e c t. 

The second class of words and phrases are what 1 call " p substitu¬ 
tors". Each subsk ttufor may be replaced by another string,* Some sub¬ 
stitutions are mandatory; others are optional and are only made if the 
problem cannot be solved without such substitutions. An example of 
a mandatory substitution is "2 times 11 for the word ’’twice"* "Twite" 
always means "2 times" in the contest of the model, and therefore this 
substitution is Tiumd a to r y, One optional "idiomatic"' substitution is 
"twite the sum of the length and width of the rectangle" for "the peri¬ 
meter of the rectangle". The use of these Substitutions in the trans¬ 
formation process is discussed below. These Substitutions are inverses 
of definitional transformations aa defined in Chapter II* 

Members of the third class of words indicate the presence of 
functional linguistic forms which represent functions in the deductive 
model. I call members of this third class "operEitors 11 . Operators 
may indicate operations which are complex combinations of the basic 
functions of the deductive model. One simple operator is the word 
"plus", which indicates that the objects named by the two variables 
surrounding it are to be added. An example of a more complex operator 
is the phrase "percent less than", .-ss in "10 percent less than the 
marked price", which indicates that the number imucdlately preceding 
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the "percent” is to be subtracted from 100 , this result divided by 100, 
and then this quotient mult IpLied by the variable following the "than"* 

Operator* may be classified according to where their arguments 

are found► A prefix operator, such as "the square of." precedes 

its argument, An operator like percent 1 ' £,* a suffix operator, 

and follows its argument. Infix operators such as 11 „ . . , ,p luS ... . .” 

or 11 .less than.,... 11 appear between their two arguments. In s 

split prefix operator such as "difference between,... .and 
part of the operator precedes, and part appears between the two 

arguments . "The SLUT: of . . ,. . and ,** + + snd." is a split prefix 

operator with an indefinite number of arguments. 

Some words may act as operator® conditionally, depending on 
their context. For example, "of" Is equivalent to "times" if there 
is a fraction icniediately preceding itj e,g. , "-5 of the profit" is 
equivalent to ",5 times the profit"} however, "Queen of England” 
does not imply a mult Sp Heative relationship between the Queen and 
her country, 

C, Tr a ns f o r:.ia £ iona 1 Fr ooed u-rx-g „ 

l*et us now consider in detail the transformation procedure used 
by STUDENT, and see how these different categories of phrase® Interact. 
To make the process more concrete, let us consider the following example 
which has been solved by PTUtENt. 


(THE PROBLEM TO BE SOLVED IS) 

(IF THE HI MB EE. OF CUSTOMERS TOM GETS JS TWICE THE SQUARE OF 
20 PER CEHT OF THE NUMBER OF ADVERTISEMEKTS EE RUNS, AWE IffE 
HlMBER OF ADVERTISEMENTS Ilf RirNB IS 45 * WHAT IS THE HUMBER 
OF CUSTOMERS TOM GETS Q.) 
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Shown below ere copies of actual printout from the STUDENT pro- 

■ 

grnm, illustrating stages in the transformation and the solution of the 
problem* The parentheses are an artifact of the LISP programming lan¬ 
guage,. and "Qis a replacement for the question mark not av*E h-hle 
on the hey punch. 

The first stage in the transformation is to perform alt winda- 
Cory substitutions. In this problem only the three phrases underlined 
(by the author* not the program) are substltutor s; "'twite" becomes 
"2 times", "per cent 11 becomes the single word "percent"* and ^square 
of" is truncated to "square"* Having made these e ubs t it ut ion-S, STUDENT 
prints: 


(WITH MANDATORY SUBSTITUTIONS THE Problem Is) 

(IF THE NUMBER OF CUSTOMERS TOM GETS IS j_ TIMES THE SQUARE 
20 PERCENT OF THE MEMBER OF ADVERTXSEMEHTS HE RUNS, AND THE 
NUMBER OF ADVERTISEMENTS HE RUNS IS 45, WHAT IS THE NUMBER 
OF CUSTOMERS TOM GETS Q.) 


From dictionary entries for each word, the words in the problem 
Sre tagged by their function in terms of the transformation process* 
and STUDENT prints: 


(WITH WORDS TAGGED BY FUNCTION THE PROBLEM 1$) 

(IF THE NUMBER (OF / OF) CUSTOMERS TOM (GETS / VERB) IS 
2 (TIMES /OP 1) THE (SQUARE /OP 1) 20 (PERCENT / OP 2)(OF/OF) 
THE NUMBER (OF / OF) ADVERTISEMENTS (HE / PRO) HUNS, AND THE 
NUMBER (OF / OP) ADVERTISEMENTS (HE / PHD) RUNS IS 45, 

(WHAT / QWORD) IS THE NUMBER (OF / OP) CUSTOMERS 
tOM (GETS / VERB) (QMARX / DLM)) 
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If a word has a tag„ or tags, the word followed by "/"i followed by 
the tags* becomes a single unit, and is enclosed in parentheses + Some 
typical taggings are shown above. "(OF/OP)” indicates Chat M OF" is 
an operator and other taggings show that "GETS” is a verb, rr TlMES ,r 
is an operator of level I (operator levels will he explained below)* 
"SQUARE” is an operator of level 1, "PERCEHF r is an -operator of level 
2, ,r HE” is a pronoun* ’'WHAT” is a question word, and "QMAEtK" (replac¬ 
ing Q*) is o delimiter of a sentence. These tagged words will pl-iy 
the principal role in the remaining transformation to the set of 
equations implicit in this problem statement. 

The next stage in the transformation is to break the input sen¬ 
tences into "kernel sentence* tr , As in the example, a problem may 
be stated using Sentences of great graimatical complexity; however, 
the final stage of the transformation la only defined on a set of 
kernel sentences. Tine simplification to kernel sentences as done in 
STUDEKT depends on the recursive use of format matching. If an in¬ 
put sentence is of the form 11 IF 1 ' followed by a substring, followed by 
a comma* a question word £ti« 3 o second substring (i.e. it Hatches the 
METEOR left half "(rF $ , ($1/ QWQEIK $) rr ) then the first substring 
(between the IF and the comma) is made art independent sentence, and 
everything following the Comma is made irttb a second sentence. In 
the example* this means that the input is resolved into the fol¬ 
lowing two sentences, (where tags arc emitted for the sake of brevity^ 

"The number of customers Tom gets is 1 Limes the 
square 20 percent of the number of advertiintntS 
he runs* end the number of advertisements he runs 
Is and ,F Whst is the number of customers Tom gets?” 


This last procedure effectively resolves a problem into declara¬ 
tive assumptions end 3 quC-Stlon sentence. A second complexity resolved 
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by STUDENT is illustrated in the first sentence of thU pair. A co¬ 
ordinate Sentence cons rating of two sentences Joined by a comma itn- 
medlately followed by an "and" (i.e. 3 any aentente matching the 
METEOR left half "(.$ * AND $) PF ) will be re solved into these two -In- 1 
dependent sentences. The first sentence above is therefore resolved 
int 0 two simpler sentences* 

Using these two inverse syntactic transformations ,. this prob¬ 
lem Statement is resolved Into "simple kernel -Sentences, for the 
l;x amp 1 e , STUDENT prints 


(THE SIMPLE SENTENCES ARE) 

(THE NUKHJiR (OF/OP) CUSTOMERS TOM (CUTS / VERB) IS 
2 (TIMES /OS i) THE (SQUARE / OP 1) 20 (PERCENT / OP 2) 

(OF / OP) THE NUMBER (OF / OP) ADVERTISEMENTS (HE / FRO) 
RUNS (PERIOD / DLM» 

(THE NUMBER (OF / OP) ADVERTISEMENTS (HE / PRO) RUNS IS 45 
(PERIOD / DLK)) 

((WHAT / QUO RIP) IS THE NUMBER (OP / OP) CUSTOMERS TOM 
(GETS / VERB) (QHARK / DIM)) 

Each simple sentence la a separate list, t.e,„ is enclosed in paren¬ 
theses , and each ends with *5 delimiter (a period or q^itlon mark). 
Each oE these sentences idn now be trana formed directly to its infer- 
ptetAtion in the model. 
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H. From Kernel Sentences to Equations a 

The trail*formation froti the simple kernel sentences to equa¬ 
tions uses three levels of precedence for operators. Operators of 
higher precedence level ere used earlier in the trensformation* Be¬ 
fore utilizing, the operators,. STUDENT looks for linguistic forms 
associated with the equality relation* These fame Include the eopuU 
"is" and transitive verba in certain content*. In the example we ere 
considering* only the copula "ia" is used to Indicate equality. The 
use of transitive verba aa indicators of equalityf that Is, as rela¬ 
tional linguistic forms„ will he discussed in connection with another 
example. Vhen the re1at ionsL Linguistic form is Identified* the 
names which are the arguments of the form are broken down into 
variables and operator* (functional linguistic form*). In the present 
problenu the two name* are those on either side of the "ie" in each 
sentence■ 

The word "is" may also bo used meaningfully within algebra 
story problems os an auxiliary verb (not meaning equality) in such 
verbal phrases as "is multiplied by 1 ' or ''is divided by". A special 
check is made for the occurrence of these phrase* before proceeding 
on to the main transformation procedure. The trSn*formation of sen¬ 
tences containing these special verbal phrases will he discussed later. 
If "is” doc* not appear as aft auxiliary in such a verbal phrase * a 
sentence of the form "Pi la f2" ia interpreted as indicating the 
equality of the objects named by phrases El and p2, Mo equality 
relation will be recognized within these pbr**#( d even if an appro¬ 
priate transitive verb occurs within either of them. If PL* and 
P2* represent the arithmetic transformations of Pi and P2, then "Fl 
is Pi" Ls transformed Into the equation 

"(EQUAL PI* P2*}". 
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Ttn" transformation of P1 And P2 to give them 51 n interpretation 
in the model it performed recursively using, a program equivalent to 
the table in Figure 4. Tliis table shows all the operators and for¬ 
mats Current Ly recognized by the STUDENT program. New operators can 
easily bo added to the program equivalent of this table. 

In performing the transformation of a phrase P N s. Left to 
right search is made for an operator of level 2 (indicated by sub¬ 
scripts of "OP" and 2), If there is none, a left to right Search is 
made for a level l operator (indicated by subscripts n 0P ri and 1), 
and finally another left to right search is made for an operator of 
level 0 (indicated by a subscript "OF" and no numerical subscript). 

The firat operator found in this ordered search determines the first 
step in the transformation of the phrase* This operator and its con- 
text are transformed as indicated in Column 4 in the table. If no 
operator is present, delimiters and articles (a, an and the) are de¬ 
leted p and the- phrase Is treated as an indivisible entity, a variable. 

In the ex-amp 1 a, the first Simple sentence Is 

(THE NUMBER (OF/QP) CUSTOMER:; TCEM. (GETS/VERB) IS 

2 (TIMES/OP 1) THE (SQUARE/OF 1) 20 (PERCENT/OF 2) 
(OF/OF) THE NUMBER (OF/OF) ADVERTISEMENTS 
(HE/PRO) RUNS (PERK®Mil)) 

This is of the form "Pi is P2 m t and Is transformed to (EQUAL El* P2*J, 
Fl is "(THE NUMBER (OF/OF) CUSTOMERS TOM (GETS/VERg) }", The occur¬ 
rence of the verb "gets" is ignored because of the presence of the 
"is" in the sentence t meaning "equals", The only operator found 
is "(OF/OF)". From the table we see that if "OF" is immediately pre¬ 
ceded by a number (not the word "number") it is treated as if it 
were the infl* "TIMES"* In this ca»t, however, "OF" is not preceded 
by a number; the subscript OP, indicating that n QF" U an operator, Is 
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Operator 

Precedence 
Leva L 

ComtekL 

Interpretation in the Model 


PLUS 

2 

71 PLUS P? 

(run pi* f2*> 

c*> 

mss 

0 

71 7L1R? PZ 

(FL1I& FL* FiP> 

w 

HiSTIS- 

z 

pi minis p2 

(PEJUS PL* {KDNIB FI*)) 

<*} 



KlNfE Vi 

(OTTOS Pi*> 


HTKUS3 

i> 

71 HUniSS 72 

(fe.iis fi* (Kunre f2*}> 

(t) 

TIKES 

l 

71 TIMES Pi 

(TIKES PI* H*) 


DivaK 

i 

FI U1VRT pi 

(QUOTIENT 71* 72*) 


SQUARE 

i 

MIlLV. r:L PL 

(EXPT 71* i> 

w 

SQUARED 

0 

PI SQUARED- 

(EKP7 71* 2> 


7* 

0 

71 ** 72 

(Lit FI Fl* Pi*) 


LHSTMH 

2 

Pi c^eestkah pi 

(PUTS 7?* (MINUS 71*)) 


PER 

Q 

PI FES E re 

(fSWTOEST PI* (K F2)*) 

t*> 



PI TER pi 

(J^IOTIEIT Fl+ (1 F2)*) 


PE&CBfl 

z 

PI 7 PERCENT pi 

(Pi WIGO) Pi)* 

(E> 

PfiRiHS 

2 

Pi 7 FLRLEES Pi 

tPi<(iw-K>/iw> 72 )* 

<0 

ELK 

0 

SLTl 71 AND P2 AED T3 

(PLOS Pi* (EL*C F2 AND F3)p) 




:JIJH 71 AND ?2 

(FLOS PL* n*) 


J) It'PtHlfhCS 

Q 

LITFEREKCE BE.7VE2K 

PI AND- 72 

(FLOS PL* (HUiUfl pi*)) 


or 

a 

It (IF F2 

CtlMfcS K Pi*) 




PI UP PI 

(71 OF 72)* 



(jO if pi [h i p'hrjfni. PL* lndlonte-D Lt: interpretation]] in Che r»odel T 
<b) FLUES *ni MDOlSS are idem-Leal to PLUS AND HINH5 except tor precedence- Level, 
(e) Hhui two pa to lb La cor.taisti are Ind lexted. they cliackuii 1 b tti# a-rder shewn, 
(d) $ QUAKE Pi mnl SCTI 71 oi'ft UIhmCId s-horccr.Ln^tf of SQl'iiKlF- 07 Pi and 5LW -07 Pi¬ 
ty) * cut aide a parentheiiited ?«?*>** a i-o* lodloitcn tltac :nc ctic Loocd phr-sae id 

co t-r: tr ana fenced , 

{£) If la * runict- 

(g) f *r.d i- inpiy th»t She IndSeacHtt arithJBCtic opera ti-ona are actually peric-roed. 


Figure 4; Op-i-rdt or a fl.ee oa~H gccL by STIIT-Ml?!T 
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a tripped away, and the transformation process is repeated on the 
phrase with "OF** no longer acting a E gn operator* In this repeti¬ 
tion* no operators arc- found, and Fl* i 3 the variable 

(NUMBER OF CUSTOMERS TOM (GETS/VERB) ) * 

To the right of "IS" in the sentence ij p£: 

(2 (TIMES/OP 1) THE (SQUARE/0F 1) 20 (FEECEnT/of £) (OF/OP) 

THE NUMBER (OF/oP) ADVERTISEMENTS (HE/PRO) RUNS (PERIOD/DLM)) 

Tlie first operator found In F2 lb PERCENT, an operator of level 
2, Front the table in Figure 4, we aee that this operator has the effect 
ot dividing the number irmed1atc1y preceding it by 100. The "PERCENT" 
iB reevoved sckI the transformation is repeated on the remaining phrase - 
In the example, the "...20 {PERCENT/OF 2} (OF/OP)..." becomes 
"... . 2000(OF/OF) 

Continuing the transformation, the operators found are, In 
ordtft* TIMES, SQUARE, OF and OF. Each is handled as indicated in 
the table. The "OF 11 in the context "... ,2000 (of/OF) THE 

is treated as an Infix TIKES, while at the other occurrence of "OF", 
the Operator marking is removed. The resulting transformed expres¬ 
sion for F£ is: 

(TIME 2 (EXFT (TIKES .2 (NUMBER OF ADVERTISEMENTS 
(HE/PRO) RUNS)) 2)) 

The transformation of the second sentence of the example is 
done in p similar manner, and yields the equation: 

(EQUAL (NUMBER OF ADVERTISEMENTS (HE/PRO) RUNS) 45) 
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The third sentence is of the form 'Vhat is PL7". It starts with 
a question wotd and is therefore treated socially. A unique ¥iri*ble t 
a single word consisting of an X of G followed by five integers, 
is created, and the equation (EQUAL Xn.nn.nn Pi*} is stored, Far this 
example, the variable XQGQOI was created* and this last simple sen¬ 
tence is transformed to the equation: 


(EQUAL XOOOQl (NUMBER Cl F CUSTOMERS TOM (GETS/VERB)) 

In addition, the created variable ia placed on the list of variables 
for which STUUENT is to find a value. Also, this variable is stored, 
paired with FJL, the uncransfornied right side, for nsc in printing out 
the answer. If a value is found for this variable, STUDENT prints the 
sentence (PI is value ) with the appropriate substitution for va1ue - 
Below we show the full set of equations, and the printed solution given 
by STUDENT for the example being considered. For ease in solution, the 
last equations created are put first in the List of equations. 


(THE EQUATIONS TO BE SOLVED AJtE) 

(EQUAL X0OQQ1 (NUMBER OF CUSTOMERS tUM (GETS/VERT))) 

(EQUAL (NUMBER OF AOTEKTISBffiNTS (HE/PRO) RUNS) 45) 

(EQUAL (NUMBER OF CUSTOMERS TOM (GETS/VSRB)) (TIMES 2 (EXFT 
(TIMES ,2000 (NUMBER OF ADVERTISEMENTS (HE/PRO) RUNS)) 2))) 

(THE NUMBER OF CUSTOMERS TOM CRTS IS 162) 


In the example just shown, the equality relation was indicated by the 
copula M is lf . In the problem shown below, solved by STUDENT, equality 
is indicated by the occurrence of a transitive verb in the proper context. 
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(THE PROBLEM ID EE SOLVED IS) 

(TOM HAS TWICE AS MANY FXSH AS MARY ilAB GUFF1ES* IF MARY HAS 
3 GUPPIES-j, WHAT IS THE NUMBER OF FIS|I TOM HAS q + ) 

(THE EQUATIONS TO EE SOLVED ARE) 

(EQUAL K00Q01 {NUMBER OF FTSH TOM (HAS/VERB))) 

(EQUAL (NUMBER OF GUPPIES (MARY/PERSON) (HA5/VERB)) 3) 

(EQUAL (NUMBER OF FISH TOM (HAS/VERB)) (TIMES 2 (NUMBER 0? 
GUPPIES (MARY/PERSON) (HASA r HfiB))}) 

(THE NUMBER OF FISH TOM HAS 1$ 6) 

The verb in this case is "has", The simple Sentence 'Mary has 3 
guppies" is transformed to the "eqg.lv a lent” sentence "The number t>f 
guppies Mary has is 3" and the processing of this latter sentence is 
done os previously discussed. 

The general format for this type of sentence and the format 
of the Intermediate sentence to which it is transformed is best ex¬ 
pressed by the following METEOR rule; 

i* (4(Sl/muO($L/HlRBER) $) (THE NUMBER OF 4 1 2 Is 3) *) 

This rule may be read: anything {a Subject) followed by a verb fol- 
lowed by a number followed by anything (the unit) is transformed to 
a sentence starting with "THE HUMBER 0F ,p followed by the unit, fol¬ 
lowed by the subject And the verb., followed by "IS" and then the 
number. In "Mary he# 3 guppies'" the subject is "Mary", the verb "has", 
and the units ''guppies"* Similarly, the sentence '"Che witches of 


Firth brew 3 magic potiona" would be transformed to 


"The number of magic potions the uit'ches of Firth brew is 3 

In iitlil i t i po to >3 declaration of number, a a ingle-object tran¬ 
sitive verb Tik'iy be used in a comparative structure* such as exhibited 
in the sentence "Turn has twite as many fish as Mary has gcippies." 

The METBGFL rule which gives the effective trace forma t ion for this 
type of sentence structure is: 

{* {$ {$ 1/VEEIB) $ A& MAM $ AS ? ($1/VEHB> $) 

{THE NUMBER OF 6 1 2 E$ 3 THE WlXrfMlEIC OF 10 S 9) *) 

For the example, the transformed sentence is: 

"The number of fish Tom has Is twice the number of guppies 

Mery has" 

Transformation of new sentence formats to formats previously 
"understood" by the program con be easily added Co the program* thus 
extending the subset of English "'understood'" by STUDENT. To the pro¬ 
cessing that actually takes place within STUDENT the intermediate 
Sentences shown never exist. It was easier to go directly to the 
node 1 from the format, utilising subroutines previously defined in 
terms <j£ the semantics of the model. 

The word "is" indicates equality only if it is nut used as 
an auxiliary. Tlia example below shows how verbal, phrases containing 
"is", such as "is multiplied by", and "is increased by" are handled 
in the traneformation. 


(THE PROBLEM TO BE SOLVED IS) 

(A NLKBER LS MULTIPLIED BY 6. IS PRODUCT IS INCREASED BY 44. 

THIS RESULT IS 68 . FIND THE NUMBER .) 

(TUB EQUATIONS TO lit SOLVED ARE) 

(EQUAL XOOOOl (NUMBER)) 

(EQUAL (PLUS (TIMES (HUMBER) 6) 44) 68) 

(THE HUMBER IS 4) 

The sentence "A number is multiplied by 6 11 only indicates that 
two objects in the model are related multiplieatively, end does not 
Lndicnte enplititly any equality relation. The interpretation of 
this sentence in the model is the prefix notation product: 

(TIMES (NUMBER) b) 

This letter phrase is stored £n a temporary location for possible 
Iriter reference. In this problem, it is referenced in the next sen¬ 
tence, with the phrase "THIS PRODUCT*', The Important wvird in this last 
phrase is "THIR" — STUDENT ignores all other words in a variable con¬ 
taining the key word "THIS", Die last temporarily stored phrase is 
substituted for the phrase containing "THIE”. Thus, the first three 
Sentences in the problem shown above yield only one equation, after 
two substitutions for "this" phrases. The last sentence ’’Find the 
number," ie transformed as if it wore "Uhac is the number Q, 11 . 
and yields the first equation shewn. 

The word "this" may occur In a context whore it is not 
referring to a previously stored phrase. Bolow is an example of 
such 0 context. 


(THE PROBLEW TO BE SOLVED IS) 

(THE FRIGE OF A RADIO IS 69.70 DOLLARS . IF THIS PRICE IS 

15 PERCENT LESS TRAN THE WBKED PRICE , FIND THE MAR1SD PRICE -) 

(THE EQUATIONS TO BE SOLVED ARE) 

(EQUAL K00001 (MARKED PRICE)) 

(EQUAL (PRICE OF RADIO) (TIMES + &499 (MARKED PRICE))) 

(EQUAL (PRICE OF RADIO)(TIMES 69*70 (DOLLARS))) 

(WE MARKED PRICE IS 82 DOLLARS) 

In such contexts, the phrase containing; ,p DiIS M Is replaced by the left 
half of the last equation created> In this example, STUDENT breaks 
the last sentence into two simple sentences, deleting the "IF"- Then 
tha phrase "THIS PRICE' 1 Is replaced by the variable "PRICE OF RADIO 1 ', 
which is the left half of the previous equation. 

This problem illustrates two other features of the STUDENT pro¬ 
gram, The first is the action of the complex operator "percent less 
than". It causes the number immediately preceding it,, i+e+* 15, 
to be subtracted from 100, this result divided by 100, to give +85 
(printed as +8499 due to a rounding error in floating point conversion). 
Then this operator becomes the infix, operator "TIMES" + This is in¬ 
dicated in the table in Figure 4 + 

This problem also illustrates how units such as "dollars" are 
handled by the STUDE3TT program. Any word which i Timed Lately follows a 
number is labeled as a special type of variable called a unit. A 
number followed by a unit la treated in the equation se a product of 
the number and. the unit, e .g. , ‘^9. 70 DOLLARS" becomes "(TIMES 
69+70 (DOLLARS))". Units are treated as special variables in solving 
the; set of equations, a unit may appear in the answer though other 
variables cannot, If the value for a variable found by the solver is 



the product of e number and a unit, STUDENT concatenites the number 
and the unit. For example, the solution for "(MARKED FRICE)" in 
the problem above was (TIMES 82 (DOLLARS)) and STUDENT printed out: 

(TUL MARKED PRICE IS 82 DOLLARS) 

Iticrc is an exception to the fact that any unit may appear in 
the answer, as illustrated In the problem below. 

(THE PROBLEM TO BE SOLVED IS) 

(IF 1 SPAN EQUALS 9 INCHES, AHO 1 FATHOM EQUALS b FEET, 

HOW MANY SPANS EQUALS 1 FATHOM Q-> 

(THE EQUATIONS TO BE SOLVED ARE) 

(Equal xooooi (times i(fathoms))) 

(EQUAL (TIMES 1 (FATHOMS)) (TWES & (FEET)}} 

(EQUAL (TTMES 1 (SFANS)) (TIMES 9 (INCHES)}} 

THE EQUATIONS WERE INSUFFICIENT TO FIND A SOLUTION 

(us ino me following known relationships) 

((EQUAL (TIMES 1 (YARDS)) (TIMES 3 (FEET)}} (EQUAL (TIMES 1 

(FEET)) (TIMES 13 (INCHES)))) 

(1 FATHOM IS 8 SFANS) 

If the unit of the answer is specified, in this problem by the phrase 
"how many spans " — then only th."«t unit, in this problem "spans 11 , 

Tiiay appear ip, the answer. Without this restriction, STUDENT would 
blithely answer this problem with "( 1 FATHOM IS 1 FATHOM)". 

In the transFormat ton from the English statement of the problem 
to the equation* s "9 INCHES" became (TIMES 9 (INCHES)), However, 
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"1 FATHOM" became "(TIMES I (FATHOMS}) 11 . The plural form for fathom 
has been used instead of the singular form, STtfDNKT always use# the 
plural form if known, to ensure that all units appear in only one 
form, Since "fathom" and "fathoms 1 ' are different, if both were used 
STUDENT would treat them as distinct, unrelated units. The plural 
form is part of the global information that tan be made available 
to STUDENT, and the plural form of a word is subetiLuted for any 
Singular form appearing after "1" in any phrase. The inverse opera¬ 
tion la carried out for correct printout of the solution. 

Notice that the information given in the problem was insufficient 
to allow solution o£ the set oE equations to be solved. Therefore, 
SrUJlKWr looked in its glossary Cot information concerning each of the 
units in this set of equations# It found the relationship* "1 foot 
equals 12 inches/ 1, and " F 1 yard equals 3 feet." Using only the first 
fact] and the equation it implies* STUDENT is then able to solve the 
problem# Thue, in certain crises where a problem is not a ns Ly t ic 3 
in the aenae that it does not contain, explicitly stated, ail the 
information needed for Its solution, STUDOT ia able to draw on a 
body of facts* picking out relevant ones, and. use them to obtain a 
solution, 

In certain problems, the transformation process dues not yield 
a set of solvable equations. However, within this set uf equations 
there exists a pair of variables (or more th;m one pair) such that 
the two variables are only "slightly different 11 , and really name the 
same object in the mpd«l. When a set of equations ie unaoLvable, 
STUDENT searches for relevant global equations# in addition, it 
uses several heuristic techniques for identiCylng two "slightly 
different 11 variables in the equations. The problem below illustrates 
the identification of two variables where in one viiri.tbtc a pronoun 
has been substituted Eor a noun phrase in the other variable- 111 is 



identification, is made by checking ell variables appearing before one 
containing the pronoun, and finding one which is Identical to this 
pronoun phrase, with e substitutton of a string oi any length for 
the pronoun. 


(THE PROBLEM TO BE SOLVED IS) 

(THE NUMBER OF SOLDIERS THE RUSSIANS HAVE I£ ONE HALF OF THE 
NUMBEF OF CUNS THEY HAVE * THE NUMBER OF GUNS THEY HAVE IS 
7000 . Uffiir IS THE MEMBER OF SOLDIERS THEY HAVE Q.) 


(THE EQUATIONS TO BE SOLVED ARE) 

(EQUAL KQ0QQ1 (HUMBER OF SOLDIERS (THEY/PRO) (HAYE/VERB))) 

(EQUAL (HUMBER OF GUNS(THEY/FRO) (HAVE/VERB)) 7000) 

(EQUAL (HUMBER OF SOLDIERS RUSSIANS (HAVE/VERB)) (TIMES ,5000 
(HUMBER OF GUNS (THEY/PRO) (HAVE/VERB)))) 


THE EQUATIONS WERE INSUFFICIENT TO FIND A SOLUTION 


(ASSUMING THAT) 

((NUMBER Of SOLDIERS (THEY/FRO) (HAVEA T ERB)) IS EQUAL TO 
(NUMBER OF SOLDIERS RUSSIANS (HAVE/VERB))) 


(THE NUMBER OF SOLDIERS THEY HAVE IS 3500) 


If two variables match in this fashion, STUDENT assumes the two 
variables are equal * printe out a statement of this assumption, as 
shown, and adds a cl equation exp r**£ i n g this equality to the set 
to he salved. The solution procedure is tried again, with this 
additional equation* In the example, the add5e ibn* 1 equation was 
sufficient to allow determination of the solution. 
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TW eicam-ple below is again ? "non-analytic" problem* The first 
Set ci£ equations developed by STUDENT i-s unsolvablct TUorefore. 
STUDENT tries to find some relevant equations in its sstofc of ®lo- 
biil information.. 


{THE PROBLEM TO EE SOLVED IS) 

(THE GAS CONSUMPTION Of MY CAR IS 15 MILES PER GALLON r 
THE DISTANCE BETWEEN BOSTON AND NEW YORK IS 250 MILES. 
WKAT IS THE NUMBER OF GALLONS OF GAS USED ON A TRIP 
BETWEEN NEW YORK AND BOSTON Q.) 


(THE EQUATIONS TO BE SOLVED ARE) 

(EQUAL K00001 (HUMBER OF GALLOWS OF GAS USED ON TRIP 
BETWEEN NEW YORK AND BOSTON)) 

(EQUAL (DISTANCE BETWEEN BOSTON AND NEW YORK) (TIMES 
250 (MILES))) 

(EQUAL (GAS CONSUMPTION OF MY CAR) (QUOTIENT (TIMES 
15(MILES)) {TIMES 1 (GALLONS)))) 


THE EQUATIONS WERE INSUFFICIENT TO FIND A SOLUTION 


(USING THE FOLLOWING KNOWN RELATIONSHIPS) 

((EQUAL (DISTANCE) (TIMES (SPEED) (TIME))) (EQUAL (DISTANCE) 
(TIMES (CAS CONSUMPTION) (NUMBER OF GALLONS OP CAS USED)))) 

(ASSWING TJlAT) 

((DISTANCE) IS EQUAL TO (DISTANCE BETWEEN BOSTON AND NEW 
YORK)) 

(ASSUMING THAT) 

((CAS CONSUMPTION) IS EQUAL TO (GAS CONSUMPTION OF MY CAR)) 
(ASSUMING THAT) 

((NUMBER OF GALLONS OF GAS USED) IS EQUAL TO (NUMBER OF 
GALLONS OF GAS USED ON TRIP BETWEEN NEW YORK AND BOSTON)) 


(THU NUMBER OF GALLONS OF GAS USED ON A TRIP BETWEEN 
NEW YORK AND BOSTON IS !&*&& GALLONS) 


It uses the first word of each variable string os o Key to it© 


JO 


glossary. The one exception to this rule is that the words ri number 
of" ere ignored if they ore the first two word* of a variable string. 
Thus , in this problem, STUDENT retrieved equations which were stored 
under the key word* distance, gallons , gas , and miles . Two facts 
about distanee had been stored earlier; "distance equals speed times 
time” and "distance equals gas consumption times number of gallons 
of gas used". The equations implicit in these sentences were stored 
and retrieved now — as possibly useful for the solution of this 
problem. In fact, only the second is relevant. 

Before any attempt is made to solve this augmented set of 
equations, the variables in the augmented set are matched, to Identi¬ 
fy "slightly different" variables which refer to the same object in 
the model. In this example "(DISTANCE.)" s "(GA& OCK&UMPIION) 11 and 
"(NUMBER £JF CALLOWS OF CAS USED) 11 ;, an; all Id cull f i ed with "similar" 
variables. The following conditions must be satis-fted for this type 
of identification of variables fi and F2; 

1) PI must appear leter in the problem than P2, 

2) PI is completely contained in ?2 in the Sense that Fl 
is a contiguous substring within P2. 

This identification reflects a syntactic phenomenon where a 
truncated phrase, with one or more modifying phrases dropped, is 
often used in place of the original phrase. For example d if the phrase 
“the length of a rectangle" has occurred, the phrase "the length" 
may be used to mean the same thing. This typo of identification is 
distinct from that made using pronoun substitution. 

In the example shove, a stored Schema was used by identifying 
the variables in the schema with the variables that occur In the prob¬ 
lem, This problem is solvable because the key phrases "distance", 

Vgas consumption" find "number of gallons of gas used” occur as 
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Substrings of the variables in the problem. Since STUD-ENT tdcnt l- 
fics each generic key phrase of the schema with a particuUr vari¬ 
able of the problem, any s chema can be used only once la a problem* 
Because STUDENT handles schema in this jnJ hoc fashion it cannot 
solve problems in which e relationship such as "distance equals 
speed times time 1 ' is needed for two different values of distance, 
speed, and time. 


E. Possible Idiomatic Substitutions. 

There are some phrases which have a dual character, depending 
on the content. In the example below* the phrase "perimeter of a 
rectangle" becomes a variable with no reference to its meaning, or 
definition, in terras of the length and width of the rectangle. 

This definition is unneeded for solution. 


(THE PROBLEM TO EE SOLVED IS) 

(THE SUM OF THE PERIMETER OF A RECTANGLE AND THE PERIMETER 
OF A TRIANGLE IS 24 INCHES, IF THE PERIMETER OF THE RECTANGLE 
IS TWICE THE PERIMETER OF THE TRIANGLE, WHAT IS THE 
PERIMETER OF THE TRIANGLE Q.) 


(THE EQUATIONS TO BE SOLVED ARE) 

(EQUAL JCOOOQ1 (PERIMETER OF TRIANGLE)) 

(EQUAL (PERIMETER OF RECTANGLE) (TIMES 2 (FEHIMKTKR OF 
TRIANGLE))) 

(EQUAL (PLUS (PERIMETER OF RECTANGLE) (PERIMETER OF TRIANGLE)) 
(TIMES 24 (INCHES))) 


(USE PERIMETER OF THE TRIANGLE IS 3 INCHES-) 


However, the following problem 1* stated in terms of the peri¬ 
meter, length and width of the rectangle. Transforming the English info 


72 




(THE PROBLEM TO- BL SOLVED I S) 

(THE LENGTH OF A RECTAUGLE IS- £ INCHES M/QRE THAN THE WIDTH 
OF THE RECTANGLE . ONE HALF OF THE PERIMETER OF THE RECTANGLE 
IS 1H INCHES , F]NP THE LENGTH AND THE W ! I)TM OF THE RECTANGLE 

H J 


(THE EQUATIONS TO RE SOLVED ARE} 

{EQUAL GASS1& (K0|i OtH. OP RECTANGLE}} 

(EQUAL G0ZS15 (LENGTH)3 

(EQUAL (TIMES + S0GQ {PERIMETER OF RECTANGLE)) { T JME-S l& (INCHES))) 

(EQ.QAL (LENGTH QF RECTANGLE) (PLUS (TIMES 6 (INCHES)} (WIDTH 
OF RECTANGLE}}) 


THE EQUATIONS HERE INSUFFIflCat TO FI NO A SOLUTION 

(USING THE FOLLOWING KNOWN RELATIONSHIPS} 

((EQUAL (T I H £$ 1 (F£ET)3 (TIMES II (INCHES})}} 

(ASSUMING THAT) 

((LENGTH) IS EQUAL TO (LENGTH QF SECTANGLE)) 

THE EQUATIONS HERE INSUFFICIENT TO FIND A SOLUTION 


tfcvjhltl POSSIBLE I PI QMS 

(THE PROBLEM NlTH AN I (HflMATI G SU0STUTI Q\ IS) 

(THE LENGTH OF A RECTANGLE IS U INCHES WORE THAN THE WIDTH 
OF THE RECTANGLE . ONE HALF QF TWICE THE SUM OF THE LENGTH 
ANO WIDTH QF THE RECTANGLE IS IS INCHES . f INP THE LENGTH ANO 
THE WIDTH QF THE RECTANGLE .} 


(the equations tq se solved aaej 

(EQUAL G02S1S (WIDTH QF RECTANGLE)) 

(EQUAL GQ251 J (LENGTH)} 

(EQUAL [TIMES (TIMES ,S0(ll) 2) (PLUS (LENGTH) (hIOTH OF RECTANGLE)}) 
(TIMES IB (INCHES}}} 

(EQUAL (LENGTH QF RECTANGLE) (PLUS (TIMES £ (INCHES)} [WIDTH 
QF RECTANGLE})) 

the equations were insuffsciedt to find a solution 

(USING THE FOLLOWING KNOWN RELATIONSHIPS) 

(C EQUAL (TIMES 1 (FEET)) (TIMES 1Z (INCHES}})} 

(ASSUMING TMAT) 

CCLENGTH) IS EQUAL TO (LENGTH! QF RECTANGLE)) 

(THE LCHGTH IS 13 INCHES) 

(THE WIDTH QF THE RECTANGLE IS 5 INCHES) 
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equations is not sufficient for solution, Neither retrieving and us¬ 
ing an equation about "inches 11 , the unit in the problem, nor identi¬ 
fying "length" with a longer phrase serve to make the problem sol¬ 
vable, Therefore* STUDENT looks in its dictionary of possible idioms, 
end finds one which it ran try in the problem* STUDENT actually 
had two possible idiomatic substitutions which it could have made 
for "perimeter of a rectangle"; one was in terms of the length and 
width of the rectangle and the other was in terms of the shortest and 
longest sides of the rectangle. When there arc two possible $u'b£t(tu~ 
tions for a given phrase, one is tried fifat » namely the one STUDENT 
has been told shout most recently* In this problem, Che correct one 
was fortunately first. If the other had been first > the revised 
problem would not have been any more solvable than the original, 

And eventually the second (correct) substitution would have 
been made. Only cue non-mandatory Idiomatic substitution is ever 
made at sne time, although the substitution is made for all occur¬ 
rences of the phrase chosen- 

In this problem* the idiomatic substitution made allows the 
problem to be solved* after identification of the variables ■"length 1 " 
snd "length of rectangle" . The retrieved equation about inches was 
not needed * However , its presence in the set of Equations to he 
solved did not sidetrack the solver in any way. 

This use of possible, but non-mandatory idiomatic substitutions 
can also be used to give STUDENT a way to solve problems in which two 
phrases denoting one particular variable are quite different. For 
example, the phrase, "students who passed the admissions test" snd 
"successful candidates" might be describing the same set of people. 
However, since STUDENT Jcnows nothing of the ""real world" and its 
value system, for success, it would never identify these two phrases. 
However, if told tint "success ful candidate-* 11 acotetime means "students 


who passed the admissions teEt", it would be able to solve a problem 
using thene two phrases to identify the same variable. Thus, pos¬ 
sible idJemotic substitutions serve the dual purple of providing ten¬ 
tative substitutions of definitions, and Identification of synoLio- 
m.o u s phrases. 


F. Special Heuristics. 

The methods thus far discussed have been applicable to the 
entire range of algebra problems„ However t for special classes of 
problems,, additional heuristics may be used which are needed for 
members of the tl-iss, but not applicable to other problems , An 
example is the class of age problems t as typified by the problem 
be low t 

(THE FROBLEM TD EE SOLVES IS) 

(BILL S FATHER S UNCLE IS TWICE AS OLD AS BILL $ FATHER,. 2 
YEARS FROM MOW BILL £ FATHER WILL IE i TIMES AS OLD AS BILL, 
THE SUM OF THEIR AGES IS 92 . FIND BILL S AGE .) 

(THE EQUATIONS TO BE SOLVED AEE) 

(EQUAL XQ0QQ1 ((BILL / PERSON) S AGE)) 

(EQUAL (PLUS ((BILL / PERSON ) S (FATHER / PERSON} & (UNCLE 
/ PERSON) S AGE) (1T,U5 {(BILL / PERSON) S (FATHER / PERSON) 

S AGE) ((BILL / PERSON) S AGE)» 92) 

(EQUAL (PUIS ((BELL / PERSON) S (FATHER / PERSON) S AGE) 2) 
(TIMES i (PLUS ((BILL / PERSON) S AGE))) 

(BILL S AGE IS S) 

Before the Eige problem heuristics are used* a problem must be 
identified as belonging to that class of problfcms* STUDENT identifies 
age problems by any occurrence of one of the following phrases t "a* old 
as", "years old" and ’’age". This identification is made iumediately 
after all words are looted up in the dictionary and tagged by function. 
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After the special heuristiee are need the modified problem ie trans¬ 
formed to equations as described previously. 

The need for special mothode for as# problems arises because 
of the conventions used fof denoting the variables, all of which are 
agea. The word age is usually nnt used explicitly, but is implicit 
in such phrases a£ "as old j$"* people 1 s names are used where their 
ages are really the implicit variables. In. the example, for instance, 
the phrsse "'Bill’s father 1 * unde" is used instead of the phrase 
"'Bill's father's uncle's nge" - 

STUDENT uses a spec list heuristic Co make all these ages ex- 
plicit- To do this, it must know which words are "person words" and 
therefore, may be associated with an age* For this problem STUDENT 
has been told that Bill , father, and unde are person words. They 
can be Seen tagged as such in the equations* The " " following a 
word is Che STUDENT representation for po3 S-eiS ive, used instead of 
"apostrophe - s" for programming convenience► STUDENT inserts a 
"S AGE" after every person word not followed by a "S" (because this 
"a" indicates that the person word is being used in a possessive 
sense, not 49 an independent age variable). Thus, as indicated, 
the phrase "BILL S FATHER S UNCLE" become* "BILL S FATHER S UNCLE S 
AGE 1 '. 


In addition to changing phroses naming people to ones naming 
ageE, STUDENT makes certain special idiomatic substitutions. For 
the phrase "their ageE 11 , STUDENT substitutes a conjunction of all 
the age variables encountered in the problem* In the example, for 
"THEIR AGES" STUI)PIT substitutes "BILL S FATHER $ UNCLE S AGE AND 
BILE 5 FATHER S AGE AND BILL 5- ACE", The phrases "as old ns" and 
"years old" are then deleted as dummy phroses not having any meaning, 
and “will be" and "w*9" ate changed to "Is"* There Is no need to 
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preserve Che tense of the copula, since Che Ser e of the future or 
past t e-n.ee is preserved in such prefix phrases as "2 years from now" , 
cr n 3 year* ago". 

Tlie rtma j. n i ng specie L age problem heuristics are used to process 
the phrases "in % years", "'5 years ago" end "nw", The phrase "2 
years from now" ie transformed to “in 2 years' 1 before processing. 

These three time phrases may occur immediately after the word "age 1 ", 
(e.g.. "fill's a^c 3 years ago") or at the beginning of the sentence. 
If a time phrase occurs at the beginning of the Sentence, it implic¬ 
itly modifies all ages mentioned in the sentence, ckcc £i those 
followed by their own t icne phrase, For example, "in 2 years Bill’s 
father's ago will be 3 times Bill's age'" Is equivalent to "Bill's 
father's age in 2 years will be 3 times Bill's age in 2 years". How- 
ever, "3 years ago Mary's age was 2 times Ann's age now" is equivalent 
to Gary's ago 3 years ago was 2 times Ann 1 ! ege now". Thus prefix 
time phr-'iSos Are handled by distributing them over all ages not 
mudifled by another time phrase. 

After these prefix phrases have been distributed, each time 
phrase is translated approprlately* The phrase "in 5 years'' causes 
5 to be added to the age it follows* And 11 7 years ago" causes 7 
to be subtracted from the age preceding this phra se. The viin] "now" 
is deleted. 

Only the special heuristics described thus far were necessary to 
solve the first age problem* The second age problem, givon below, 
requires one additional heuristic not previously mentioned. This 
is a substitution for the phrase "wos when" which effectively de¬ 
couples the two facts couMned in the first sentence. Fop "wAs 
when 1 ” f STUDENT substitutes "was K year® + K years ago" where 

K is a new variable created for this purpose* 
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(THE PROBLEM TO EE SOLVED 15) 

(MAHY IS WICE AS OLD AS ANN WAS WHEH HAKY WAS AS OLD AS ANN 
IS mw , IF MARY IS 24 YEARS OLD, HOW OLD IS ANN Q.) 

(THE EQUATIONS TO BE SOLVED ARE) 

(EQUAL XMOOS ((ANN / PERSON) S AGE)) 

(EQUAL (( MARY / PERSON) S AGE ) 24) 

(EQUAL (PLUS ((MARY / PERSON) S ACE) (MINUS (KQQOQ7))) ((ANN 
/ PERSON) £ AGE)) 

(EQUAL ((MARY / PERSON) S AGE) (TIHES 2 (PLUS ((ANN / PERSON) 
£ AGE) (MINUS (S&0007))))) 

(ANN £ AGE IS IS) 

In the eseampLe* the first sentence becomes the two sentences: 

"Mary is twice as old ss Ann XDOOD7 years ago* SD0007 years age 
Hary was ss old as Ann is now, 11 These two occurrences of time 
phrases are handled as discussed previously, Similarly the phreee 
"will be when" would be transformed to "in K rears , In K years". 

These decoupling, heuristics are useful not only for the STUDENT 
program but for people trying to solve age problems. The classic age 
problem about Hary and Aon* given above, took an MIT graduate student 
over 5 minutes to solve because he did not know this heuristic, with 
the heuristic he was able to set up the appropriate equations much 
more rapidly. As a crude measure of $TUDENT f e relative speed, note 
that STtfDDJT took less than one minute to solve this problem. 


G, When All Else Fails , 

For a LI the problems discussed thus far, STUDENT was able to 
find a solution eventually. In some cases, however* necessary glo¬ 
bal information is missing from its store of information, or vari¬ 
ables which name the same object cannot be identified by the lieuris- 
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Clcs of the program. Whenever STUDENT cannot find a solution for any 
reason, it turns to the quest latter far ht*lp+ As in the problem 
below, it prints out "(DO YOU KNOW ANY WORE RELATIONSHIPS BETWEEN 
THESE VARIABLES)" followed by a list uf the variables in the problem. 
The questioner can answer N ycs ri or "no"* If be says "yes", 

STUDENT says "TELL ME"* and the questioner can append another sen¬ 
tence to the statement of cbe problem. 


(THE PROBLEM TO BE SOLVED IS) 

(THE GROSS WEIGHT OF A SHIP IS 20000 TONS . IF ITS NET 
WEIGHT IS 15O0Q TONS , WHAT IS THE WEIGHT OP THE SHIPS 
CARGO Q.) 

IKE EQUATIONS WERE INSUFFICIENT TO FIND A SOLUTION 


TRYING POSSIBLE IDIOMS 


(BO YOU ENOW ANY MORE RELATIONSHIPS AMONG THESE VARIABLES) 
(GROSS WEIGHT OF SKIP) 

(TONS) 

(ITS NET WEIGHT) 

(WEIGHT OF SHIPS CARGO) 


yes 

’ TELL ME 

(the -weight of s ships cargo ie the; d i f ference between 
the gross weight and the net weight) 

THE EQUATIONS WERE INSUFFICIENT TO FIND A SOLUTION 
(ASSUMING THAT) 

((NET WEIGHT) IS EQUAL TO (ITS NET WEIGHT)) 

(ASSUMING THAT) 

((CROSS WEIGHT) IS EQUAL TO (CROSS WEIGHT OF SHIP)) 
(THE WEIGHT OF THE SHIPS CARGO IS 5000 TONS) 
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In this problem, the additional Information typed J-r (in Lower 
case letters) was sufficient to solve the problem r If it was not, 
the question would be repeated until the questioner said "no", or 
provides sufficient information for solution of the problem. 

In the problem below, the solution to the set of equa¬ 
tions involves solving a quadratic equation, which, is beyond the 
mathematical ability of the present STUDENT system. Note that in 
this case STUDENT reports that the equations were unaolvable, nut 
itinpljf insufficient for solution- STUDENT still requests additional 
information from the questioner. In the example, the questioner says 
"tu, 11 , and STUDENT state* that "I CANT SOLVE THIS PROBLEM" and terminates. 

(THE PROBLEM TO EE SOLVED IS) 

(THE SQUARE OF THE DIFFERENCE BETWEEN TOE NUMBLK OF 
APPLES AND THE NUMBER OF ORANGES ON THE TABLE IS EQUAL 
TO D „ IF THE NUMBER OF AFFLE5 IS J , FIND THE NUMBER 
OF ORANGES ON THE TABLE .) 

(THE EQUATIONS If) BE SOLVED ARE) 

(EQUAL GQ2515 (NUMBER OF ORANGES Of: TABLE)) 

(EQUAL (NUMBER OF APPLES) 7) 

(KQ1.JAL (EXFT (PLUS (NIJHBHt OF APPLES) (MINUS (NUMBER 
OF ORANGES ON TABLE))) 2) 9) 

UNABLE TO SOLVE Tilts BET OF EQUATIONS 
TRY TNG POSSIBLE IDIOMS 

(DO YOU KNOW AMY MORE RELATIONSHIPS AMONG THESE 
VARIABLES) 

(HUMBER OF AFFLES) 

(HUMBER OF ORANGES ON TABLE) 
no 

E CANT SOLVE THIS PROBLEM 


SO 


H. St linn-ary a £ the STUDENT Subset of English. 

The subset of F'in.g 1 i e h unders tsixlable by STUDKJJT is built 
acoyflJ .1 core of sentence and phrase formats, which can be transformed 
into expressions in the STUDENT deductive model. Cm this basic 
Core is buiLt a larger set of formats* Each of these are first trans¬ 
formed into a string built on forgets in this basic set and then this 
String is transformed into an expression in the deductive model. For 
example, the format ($ 15 EQUAL TO ?) is changed to the baaie for¬ 
mat (5 IS $), and the phrase "15 CONSECUTIVE To" Is changed to 
"IS 1 PLUS’" >■ The constructions discussed earlier involving single 
object transitive verbs could have been handled this way, though 
for programming Convenience they were not. 

The complete list of the basic format a accepted by the present 
STUDENT system can be determined, by examining (in the program list¬ 
ing in the Appendix) the rules from the one labeled OPFQRK to the one 
Labeled QSET* The METEOR rules of the STUTENT 1 program precisely 
specify tb* acceptable forioats, and their translations to thv nwdel* 
but I shall try to summarise the basic and extended formats here. 
Implicitly assumed in the syntax is that any operator appears onl-. 
within one of the contexts Specified in the table given in Chapter II„ 
and only the operators given in the table appear. The listing of 
STUDENT Starting at the rule labeled IDIOMS gives tcans Litions of 
additional operators to those in the table. 

The basic linguistic form which is transformed into an 
equation is one containing "is" as a copula, The phrases H i* equal 
to" and "equals" are both changed to the copula "is". The 
auxiliary verbal constructions "is multiplied by", "is divided by" 
and "is increased by" are also acceptable as principal verbs in a 
Sentence, As discussed in detail earlier, a sentence with no 
occurrence of "is" can have as a main verb a transitive verb insnedi- 




scely followed by a number. This number must be a* element of the 
phrase which is the direct object of the verb, 35 in ">1ary has 
three guppies*'. This type of transitive verb can also have a compara¬ 
tive structure as direct object, €.g.., n Hary ha a twice as many 
guppies as Tom has fish" ■ 

This completes the repertoire of declarative sentence formats. 
Any number of declarative sentences may he conjoined, with "Hand" 
between each pair, to form a new (complex) declarative sentence,. 

A declarative sentence (even a complex: declarative) can be made 
a presupposition for a question by preceding It with 'TF h and fol¬ 
lowing it with a ccoria and the question. 

Question?,, that is, requests for information from STUDENT,. will 
be understood if they match any of the pstternSf 


(HEAT IS $) 


(WHAT AJtE 5 AND 


(FIND $)■ 

(HOW MANY $ TONS $ HAVE) 


(FIND 5 AND 

(HOW MANY $ DD $ HAVE) 

(HOW MANY $1 IS $) 


Tills completes the summary of the set of input formats present¬ 
ly understood by STUDENT. This Set can be enlarged in two distinct 
ways. One is to enlarge the set of basic formats, using standard 
subroutine? to aid in defining, for each new basic format, its inter-* 
pretation in the deductive model. The other method of cutending the 
range of STUDENT input is to define transformations from new input 
formats to previously understood basic or extension formats. In the 
next chapter we discuss how thi? latter type of extension can be 
performed at run time, using the STUDENT global information storage 
facility,. A combination of English and HETEOH elementary pattern 


elements can be need to define the input format and transformation* 


Even if j story probloro, is stated within ftic Subset of English 
acceptable to 5TUPENT , this is not a guarantee that this problem can 
be solved by STUDENT (assuming it to be solvable)* Two phrases des¬ 
cribing the object must be *t worst only "slightly different 1 ' by 
the criteria prescribed earlier* Appropriate global Information 
must be available to STUDENT. and the algebra involved must not ex¬ 
ceed the abilities of the solver. However , though most algebra story 
problems found in the standard texts cannot ho Solved by STUDENT exact 
as written the author has usually been able to find some paraphrase 
0 -f almost all such problems, which is solvable by STUDENT- Appendix D 
contains a fair sample of the range of problems that can bo handled 
by the STUDENT system. 


I. Limitations of cm: -STUDENT Subset of English. 

The techniques presented in this chapter are general and tan 
be used to enable a computer program to accept and understand a 
fairly extensive subset of English for a fixed semantic base. How¬ 
ever,, the current STUDENT System is experimental and has a number of 
1 imitations. 

STUDENT'S interpretation of the input is based on format 
matching. If each format is used to express the moaning understood 
by .STUDENT j no misinterpretation will occur. However * these formats 
occur in English discourse even in algebrs story problems, in semantic 
Contexts not consistent with STUDENT 1 # interpretation of these for¬ 
mats, For example, a sentence matching the format "{$ j AMD $}" 
is always interpreted by STUDENT as the conjunction of two declarative 
statement#■ Therefore, the sentence "Tom has Z apples, 3 bananas h and 
£i peara l" would! be incorrectly divided into the Luo "sentences"' 



Ton has 2 appIcS, 3 bananas." 


a ad "4 pears. 1 ' 1 


Each of the optima In r words shown in Figure 4 must be used as 
an operator in the context as shown or a miainterpretation will 
result* For example, the phrase "the number of times I vent to 
the movies" which should be interpreted as 0 variable String will be 
interpreted incorrectly as the product of the two variables "number of' 
and "I went to the movies”* because "times 11 is always considered to 
be an operator. Similarly t in the currant implementation of STUDENT, 
"of" is considered to be an operator if ft is preceded by any number* 
however, the phrase "£. of the boys who passed" will be misinterpreted 
as the product of and "the boys who passed". 

These examples obviously <io riot constitute a complete list of 
misinterpretations and errors STUPES T will make,, but it should give 
the reader an idea of limitations on the STUDENT subset of English, 
frt principle, all of these restriction? could be removed. However, 
removing some of then would require only minor changes to the program, 
while others would require techniques not used in the current 
s ys t em, 

For example, to correct the error in interpreting "2 of the 
boys who passed", onO can simply cheek to see if the number before the 
"of" is leas than 1, and if so, only then interpret "of" as sn 
operator "times"* However, a much more sophisticated grCvcnar and 
parsing program would be necessary to distinguish different occur¬ 
rences of the format "(? » APP $)", and correctly extract simpler sen¬ 
tences from complex coordinate and subordinate sentences. 

Because of limitations of the sort described above, and the 
fact that the STUDENT syst<vn currently occupies almost all of the 
computer memory, STUDENT serves principally as a demonstration of 
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Che of the technique* uCilize-d in its construction, Kowev 

I believe that on a larger computer one could use tfreac techniqu 
ti> tonstruet a system of practical value which would comnunicat* 
well with people In English over Che limited ratine of material 
understood by the program* 


CHAPTER V- STimAGE OF GLOBAL INFORMATION 


This algebra probLem-soLv1ng system contains two programs 
which process English input. One is the problem thus far discussed, 
STKIiENT, which accepts the statement of an algebra story problem and 
attempts to find the solution to the particular problem. STUDENT docs 
not store any information, nor "remember 1 ' anything from problem to 
problem. The in format ion obtained by &TTJDEHT is the local context 
of the question. 

The other program is colled REMEMPER and it processes and stores 
facta not specific to any one problem-. These facts make up STUDENT e 
a tore of "global information 1 " os opposed to "local information" 
specific to the problem. This information is accepted tn a subset -of 
English which overlaps but is different from the subset of English 
accepted by ST'JPEE'JT■ 5EMEMBER accepts statements in certain fined 
formatSj and for e-fleh format the information is stored in a way that 
makes it convenient for retrieval and use within the STUDENT program. 
Some information is -Stored by actually adding WKTEOR rules to the 
STUUKHT program, ;J tid other Information is stored on property lists 
of Individual words, which are unique atoms in the LISP system* 

The following are the formats currently understood by RDIOlSEIt, 
and the processing and information storage techniques uaed for 
e a ch one: 

1. Format: Pl EQUALS P2 

Example: DISTANCE EQUALS SFEED TIKES TIME 

Processingi The sentence is transformed into an equation in 
the seme way it is done in STUDENT. This equation is stored on the 
property lists of the atoms which are the first words in cacti 





variabLe. In the GitSunpltj the equation 


"(EQUAL (BI5IAHCE) (TIMES (SFEED) (TlMR))) rl 

1h stored on the property list* of "DISTANCE", "SPEED" end "TIME". 

If any one of these words appear* aa the initial word of a variable 
in a problem, and global equaCions are needed to aolve this problem, 
this•equation will be retCloved- 

2. Format; PI IS AM OPERATOR OF LEVEL S 

Example: TIMES IS AN OPERATOR OF LEVEL t 

Processing: A dictionary entry fot FI is created, with sub¬ 
scripts of OP and K. For TIMES, the dictionary entry (TIMES / (>P 1) 
is created , The dictionary entry for any word is plated on the 
property ] 1st of that word (atom), and is retrieved a^d ofl^d in 
place of any occurrence of that word in a problem, 

i* Format t Fl IS AN OPERATOR 
Example: OF IE AN OfEKATCHt 

Processing: A dictionary entry is created for PI with the Sub¬ 
script OF- The entry for OF la (OF/OF). 

4 , Format ; Fl IS A P2 
Example: BILL IE A PERSON 

FtoCeS-9trig: A dictionary entry ia created for PL with sub¬ 
script Pi, The entry for ilu ia (BILL/PERSON) . 

5, Format £ Fl IE THF- PLURAL QE P2 
Example: FEET IE THE FLMRAL OF FOOT 

Processing: P2 ia stored on the property list of PI, after 

the flag 3INC; the word PI is stored on the property list of PZ 
after the flag PLURAL r Thus FEET is stored after PLURAL on the 


a? 






property list of the atom FOOT-. 


6 . Format: FI SOMETIMES MEANS VI 

Example: TWO NUMBERS SOMETIMES MEANS ONE NllMhER AMU THF. 

OTHER HIMBKR. 

Processing: The STUDENT program is modified so that an idiomatic 
substitution of P2 for PI will he made in a problem If it is other¬ 
wise unsolvable. All such "possible Idiomatic substitutions" art- 
tried when necessary, with the last one entered being the first one 
tried. The STUDENT program is modified by the addition of 10111 ' new 
METEOR rules. Since ?1 and P2 are inserted as left and right halves 
of A METEOR rule, they need not contain only words, but can use the 
METEOR elementary patterns to specify a format change instead cf 
just a phrase change. For the example shown, the rules added to th& 
STUDENT program, as listed in Appendix h„ are the ml* labeled 
C02510, the rule following that one, the rule labeled 002511 *nd the 
rule following it. 

7 . Format- PI AT,WAYS MEANS F2 

Example: ONE HALF ALWAYS MEANS 0.5 

Processing: The program STUDENT is mudifind so that if Pi 
occurs, a mandatory substitution of P2 for Fl will be made in any prob- 
l*m. The last sentence in this format processed by REMEMBER will 
ho the fil mandatory substitution made. Thus "one always means 1" 
followed by "one half always means U .5 11 will cause the desired sub¬ 
stitutions to be made; if these sentences were reversed no occurrence 
of "'one half" would ever be found since it would have been changed 
to "1 half", by mandatory substitution of 1 for one. 

For each sentence in this formst processed by BHHUMBHR, * 
new METEOR, rule is added to the STUDENT program, immediately fol¬ 
lowing the rule named IDIOMS, The format of the METEOR rule added 






is (* (PI) (P2) whore Pi and P2 are the strings in the sen~ 

tence processtid* Thus by using a combination of English and METEOR 
elementary patterns and reference numbers in PI and p2„ one tan add 
a new forest of sentence to the STUDENT repertoire. For example* fcih 
following statement was processed by REMEMBER to allow STUDENT to 
"understand"’ (properly tra-ns^orrn) a sentence in which the mein verb 
was "exceeds"; 

{$ EXCEEDS $ BY $ ALWAYS MEANS 1 IS 5 MORE THAN 3) 

This peroianecit ly extended the STUDENT input subset of English, 
while avoiding the necessity of actually editing and changing the 
STUDENT program* 

The global Informs t ion stored for STUDENT ranged from equa¬ 
tions to format changes to plural forms. Again, the compatible use 
of the METEOR prototype notation and the use of the general List 
processing operations in TISP facilitated programming ot' processing, 
alorago and retrieval of this wide range of £nf«rmation. In Appen¬ 
dix C li a listing of the global information currently embodied in 
the STUDENT sySten- 


CHAPTER Vi: SOLUTION OT S J Hill.TA KECKS SQUAO.Q?^ 


This chapter contains a description of the L15F program 
by STUDEKT to solve sets of simultaneous equations* The de¬ 
finitions of the three top level functions SOLVE, SOLVER and SOLVE1 
eee shown in the figure et the end of Oils chapter. Tikis descrip¬ 
tion of these functions is essentially independent of 4 detailed 
knowledge of LI5F* although occasions] parenthetical eivnnente will 
be directed to the mere knowledgeable. 

The top level function* SOLVE, is e function of three argu¬ 
ment a . One, labeled EQT in the definition of SOLVE, is the set of 
equations to he solved. The argument labeled WANTED in the defini¬ 
tion is a List of variables whose values are wanted. The third 
argument, labeled TERMS, is another list of variables whUh is dis¬ 
joint from WANTED* SOLVE, will find the value of any variable which 
is wanted in terms of any or all of the variables on the lifit TERMS, 

In use, the list TERMS la a list of units, such os pounds* or feat, 
which riiay appear in the answer. 

Hie output of SOLVE is dependent bn whether the set of equa¬ 
tions given can be solved for the variable* wanted. If no solution 
can be found because the solot ion involves nonlinear processes, SOLVE 
returns with the value UNSQLVADLE. If no solution is found because 
not enougjh equations arc given, SOLVE returns with the value INSUF¬ 
FICIENT* If however, a solution is found, SOLVE returns with -3 list 
or pairs* The first element of each pair is a variable, either on Che 
wanted list, or a variable whose value was found while solving fox* the 
desired unknowns. The second element of each pair is on arithmetic ex¬ 
press ion (in the prefix notation shown in Figure 2>* which contains 
only numbers and variables on the list TERMS * Thus, the Answer found 
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by SOLVE is an "association Hat” of variables, and Chair values 
in Che proper terms- 

For example. Let ua consider the sat of seven simultaneous 
equations shown below, and suppose SOLVE were asked to solve this 
set of equations for x and a. These see given in infix notation 
for ease of reading. 


(1) 

x + w = S 

<5> 

x -b 2y = C* 

(2) 

r 2 - c; = D 

(6) 

2 

y - 3y + 2 = z 

<3J 

C + ID = 6 

(7) 

r* 

II 

Ps 

a 

(4) 

2C * D - 5 




The list TERMS Is empty, Snd thus Che valuea must all be num¬ 
bers . In this ease ^EOLVE would return with the list of pairs 

>f ((y d 2}(Zt 0}>," which indicates that the values x = 2 and 

z =■ 0 satisfy this set of equations (or those members of this set 
which were used to determine the values). The value y = 1 was 
found during the solving process, 

Most of the work of SOLVE is done by the function SALVER 4 

SOLVE transmits to SOLVER the list of WANTED variables, Che list of 

TERMS, and a null association list (ceiled ALIS) which is recur¬ 
sively built up to give the answer- The value of SOLVER. I $ this as 
socistion list of pairs, with the first element of escEa pair 
being a variable whose value has been found- The second element of 
each is an arithmetic express ion which may contain any variable 
on the List TERMS (as was the case for the AL1S of SOLVE), however 
it may also contain variables which are first elements of pairs 
later on the association list. If values for variables given by 
Idter pairs ere substituted into this arithmetic expression, one 
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gets the arithmetic expression given by SOLVE tOiitdinid^ only 
variables on the list TEEMSr In the example, SOLVER vou Id re¬ 
turn with the association List (£y, (4x-7)) (,e, 0)) which 

gives y in terms of x. SOLVE makes the substitutions and simplifies 
Lion on the association List returned by SOLVER- 

SOL VER is a program, which solves for a list of wanted 
variables. It does this by choosing one of these variables, adding 
the others to the list of terms and calling SOLVE1 to solve for this 
one variable in terms of the other wanted variables and the original 
TERMS. If SOLVE1 succeeds in solving for this variable, SOLVER 
pairs this one variable with the expression found * puts this pair 
on the end of the ALIS, and using this substitution in every equa¬ 
tion it tries to solve, attempts to solve for the remaining wanted 
variables. If there are no more, SOLVER is finished and returns the 
association list built up. 

SOLVEI solves for a single wanted variable by finding an 
equation containing this variable, after all substitutions of 
values for variables listed on the ALIS have been made. It then 
uakee a list of all the other variables in the equation, and checks 
to see if there are any not on the list IEHU&. If so it calls 
SOLVER to solve for these new variables in terms of the wanted 
variable and the variables in TERMS. If SOLVER is unsuccessful, 
SOLVE] tries to find another equation containing the wanted variable 
and repeats the process. If there is none, SOLVE1 has the value 
INSUFFICIENT. If SOLVER is successful, and values for these new 
variables arc found„ or if there were no new variables, SOLVE1 
finally ea]Is SOLVEQ which attempts to solve this equation for the 
wanted variable. If the- equation is linear in this variable, 

SOLVE'*} will be success ful and give -a floli.itton. SOLVRl will add 
a pair consisting of the wanted variable and this value to the end 


of ALI5» and return with this augjnented ALIS as its value i If 
SOLVE*} is unsuccessful, SOLVEI tries another equation, hut then if 
nn solution can be found SOLVEI returns Che value UNSQLVABLE - 

This ■description ha.£ been a rather long-winded attempt to 
explain the one page of LISP program at the end of this chapter. 

Tb make it more specifin* let us consider what happens when SOLVER 
trios to solve the Stt of equations b^Iow fthe same ones shown 
earlier); 


U) 

x + v * 9 

(5) 

x + 2y » 4 

(2) 

2 

X - C = D 

(6) 

y 2 - 3y + 1 - a 

{3) 

C + 3D = 5 

(7) 

4x - y = ? 

(4) 

2C - D = 5 



SOLVER is 

asked to solve 

far x and i. It 

ssks SOLVEI to 


solve for k in terms of *, SOLVEI picks equation (1), finds that 
a new variable, v, has appeared end asks SOLVER to solve for v 
in terms of it and z. Since- there is no other occurrence of w in 
this Set, SOLVER is ufLSutCe5s ful and SOLVE L abandons equation (1) , 
and goes to equation (2)+ Here it calls SOLVER to solve for the 
two new variables C and 0 in terms of x anti z- In this casu 
SOLVER Is successful, using equations (3) and (4), but when these 
values are substituted In equation (2), SOLVEQ cannot solve for i 
because the equation is nut linear in x + 

SOLVEI now abandons equation (2) and the results it obtained 
as aiibgoals for solving (2). It finds an occurrence ef x again 
in (3) ■ Again it calls on SOLVER, to solve for the new variable 
X in terns of x and * + SOLVER tries to use (6>„ but SOLVEQ cannot 
solve this equation for Using (7) SOLVER returns with an ALIS 
°f CCy^C^X - ?)))■■ Using this ALIS, substituting this value for y 
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into (5), SOLVE?! calls -on EOLVEQ to solve this equation for x, 
which it dots, and finally SOLVE1 returns to SALVER the ALIS 
((y,. (4 k - which docs the value of x in terms 

of z_* Having found x in terms of z , SOLVER will now call SOLVE! 
to find the value of s. SOLVE1 finds an occurrence of z In 
equation (6}> and after substitution of terms on the ALI5, SOLVER 
is able to solve this equation for e, because it is linear in i. 
Adding the pair U,Q) to the ALIS, SOLVEl returns it to SOLVER, 
which on this ALIS (£y, (4k - 7}), s (a T 0)J to SOLVE. 

SOLVE, using. the function SUEORJP, which sub stitutes in ord er 
pairs on an ALlS into an expression and simplifies,, finally returns 
the ALIS C(y 1 l)CKp2>(z J D)). 

This anplfi shows the rather fcorCubus recursions that these 
functions use to solve a set of equations* Khy should we use this 
type of solving program instead of a more Sfcrai^htforward matrix 
method? The principal reason is that, as shown, nonlinear equations 
may appear in the set. In this case, if appropriate values c^n be 
found from other equations which 'when Substituted into this non* 
linear equation make? it linear in the variable for which we went to 
solve, then SOLVE will find the value of this variable, 

Thft method of operation of 5ALVEK requires that if n vari¬ 
ables appear in any equation, and that equation is used, then at 
least n-1 other independent equations containing these variables muat 
be in the set of equations, or the actual mechanics of solving will 
not be started. This eliminates much work if there are extraneous 
equations in the set which contain one or two of the wanted variables 
However, it precludes solving a set of equations which is homo¬ 
geneous in one unwanted variable, end would therefore cancel out 
in the solution process. This is the principal reason why problems 
s uch v s ; 
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^Spigot A fills a tufe in L hour, and. spigot E in 2 
hour*. "How long, do they take together^" 

cannot be solved by STUDENT. 

tliia solving subroutine set is an indepccwlent package in the 
STUDENT program. Therefore, improvements can be made to if without 
disturblog the rest of the processing. The routine described 
here was designed to handle most of the problems that can ho found in 
first year algebra texts. 


,: J r - 


CEAFTEK VLl; CQHCLUSION 


A- Bcsults. 

The purpose of the research reported here Co develop 
techniques ■which fee £ 1itate natural language COfrmunication u(.cli 
a computer. A fttmantic theory of Coherent d is course was proposed 
as a basis for the design and understanding of such man-machine 
systems. This theory was only outlined* and much addition*! wort 
remains to ho done. However, in its present rough form, the 
theory served as 0 guide for construction of the STUDENT system, 
which can cwimunioate in a limited subset of English. 

The language analysis in STUDENT ie an implementfttton of the 
analytic portion of this theory* The STUDENT system has a very 
narrow semantic base. From the theory it is clear Chet by utilising 
thus knowledge of the Limited tenge of meaning of the input discourse, 
the parsing problem becomes greatly simplified, since the number of 
linguistic forces that must be recognized is very small. If * 
parsing system were based on any small semantic base* this same sim¬ 
plification would occur. This suggests that in a general language 
processor, some time might he tpent putting the input into a semantic 
content before going ahead with the syntactic analysis* 


Hie semantic base of the STUDENT language analysis is delimited 
by the characteristies of the problem solving system embedded in it. 
STUDENT is a question-answering system which answers questions posed 
in the context of ''algebra story problems*" In the introduction, 
we used four criteria for evaluating several question-answering sys¬ 
tems. Let us compare the STUDENT system to these others in the Light 
of these criteria. 
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1) Extent of Understanding* All the other qu cat ion- fh- 
Sw&ring svatens discussed analyse input sentence by sentence. 

Although a representation of the meaning of all input sentences 
m;iy he placed in Some common store, no syntactic connection is 
ever made between sentences. 

In the STUDENT system, an acceptatie input is * sequence of 
sentences h such that these sentences cannot be understood by Just 
finding the meanings of the individual sentences, ignoring their 
local content. Inter-sentence dependencies mnet be determined, and 
Inter-sentefice syntactic relationships must be used in this case for 
solution of the problem given. This extension of the syntactic 
dimension of understanding is important because such inter-sentence 
dependencies (e.g.pthe use of pronouns) Are very ccnranly used in 
natural language communication. 

Hie semantic ttkjJoI in the STUDENT system ia based on one 
relationship (equality) and five basic arithmetic functions. Com¬ 
position of these functions yield other functions which are also 
expressed as individual linguistic forms in the input language. 

The input language is richer in expressing functions than Lindsay's 
0 r Raphael '3 system. To* logical system discussed may have more 
relationships (predicates) allowable in the input, but dp not allow 
any composition of these predicates* The logical combinations 
pf predicates used are only those expressed in the input as logical 
combinations (using and, or, etc.). 

The deductive system In STUDENT, as in Lindsay's and Raphael's 
programs, is deaigned lor the type of question* to he salted. It 
can only deduce aniwers of a certain type from, the input information, 
that I*, arithmetic values satisfying a set of equations. In per¬ 
forming its deductions it Is reasonably sophisticated in avoidiog 
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Irrelevant information,, as are- the other two mentioned. It lacks 
the general power of a logical system, but is much more efficient 
in obtaining its particular class of deductions than would be a 
general deductive system utilizing the axioms of arithmetic. 


2) Facility for Extending Abilities. Extending the Syntactic 
abilities of any of the other quest ion-answering systems discussed 
■would require reprogramming. In the STUDENT -System new definitional 
brans fortes Cions can be introduced at run time without any reprogram¬ 
ming. Ifse Information concerning these transformations can he in¬ 
put in English;, qr in a combipstion of English und METEOR, if that is 
more appropriate. New syntactic trans formations must be added by 
extending the program. 

The semantic base of the STUDENT System can be extended only 
by adding new program, as is trot of the other question-answering 
systems discussed. Howevar STUDENT is organised to facilitate 
such extensions, by minimizing the interactions of different parts 
of the program. The necessary information need only be added to the 
program equivalent of the table of operators in Figure 4, in Chap¬ 
ter IV ♦ 


Similarly, the deductive portion of STUDENT , which solves the 
derived Set of equation*» i* an independent package. Therefore, a 
ti&W extended solver can bo added to the system by just replacing 
the package + and maintaining the input-output characteristics of 
this subroutine* 


j'i Kn.ovl erl v c _of Interne! Structure tleeded by User - Very 
11 tda if any internal knowledge of the workings of the STUDENT 
system need be known by the user. Be must have a firm grasp of the 
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type of problem that 5TUDFNT can solve* and a knowledge of the input 
grammar. For example* tie must be aware that the eai.e phrase must 
always be used to represent the same -variable in a problem, within 
the limits of similarity defined earlier. He oust realize that 
even within these limits STTmRNT will not recognize more than one 
variation on 0 phrase, But if the V3er does forget any of these 
facts, he can still use the .system, for the Interaction discussed 
in the next Section allows him to make amends for almost any mistake. 


4) Interaction With the User. The STI.JCEKT system is embedded 
in a time-sharing environment (the KIT Project MAC time-sharing, 
system (13)), and- this greatly facilitates interaction with the 

user. STUDENT differentiates between it6 failure to solve e 
problem because of its cnathcmatits 1 limitations and failure from 
lack of sufficient informalion, la case of failure it asks the uaer 
for additional information, and Suggest# the nature of the needed 
information (relationships among variables of the problem). It 

can. go back to the user repeatedly for Information until it haa 
enough to solve the problem, or until the user gives up, 

STUDEOSfT also reports when it does not recognize the format of 
an input sentence- Using this information as a guide* the user is 
in * teaching-machine type situation, and tan quickly learn to speak 
STUBENT’3 brand of input English. By monitoring the assumpLions 
that SmiENT makes about the input, and the global information it 

uses, the user can stop the system and reword a problem to avoid 
an unwanted ambiguity, or add new general Information to the 
global Information store. 

The crucial point in this user interaction is that STUDENT is 
embedded in an on-Line time-sherlag system, and can thy.fi provide more 
interaction than any of the other systems mentioned. 
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E - Ext ens io ns . 


Tns present STUDEKT system has reached the maximum size allow¬ 
able in the USP system on a thirty-tws thousand word IBM 7094, There¬ 
fore, very little can be added directly to the present system. All 
the programming extensions mentioned here ere predicated on the 
existence of a much larger memory machine. 

Without inventing any new techniques, I think that the STITDEST 
system could be made to understand most of the algebra story prob¬ 
lems that appesr in first year high School text bouts♦ If new 
operators, ne^ Combinations of arithmetic operations occur, they 
can easily b« added to OEFOrh, the subroutine which maps the kernel 
English sentences into equations. The number of ferrets recog¬ 
nizable in the system can be increased without reprogram!ng 
through the machinery available for storing global information 
(this was discussed in more detail in Chapter V), The problems it 
would not handle are those having excessive verbiage or implied 
information about the world not expressible in a single sentence, 

As mentioned earlier, the system can now make use of any given 
schema only tmCc in solving a problem. This is because the schema 
equation is added Co the set of equations to bs Solved* and the vari“ 
ables to the schema only identified with one other set of vari¬ 
ables appearing in the problem, For example, if "distance equals 
speed times time 11 were the schema, then "distance 11 , as a variable 
in the schema might be set equal to "distance traveled by train" 
or "distance traveled by plane", but not both in the same probUn, 

This problem could be resolved by not adding the schema equation 
directly to the Set of equations to be solved, but by looking for 
consistent sets of variables to identify with the schema variables. 

Ibien iruiJKHT could add an instance of the scheiia equations, with the 
appropriate substitutions, for each consistent set <a£ variables 
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found which are "similar" to the schema variables. 


At the moment the striving subroutine of STUDENT can only per¬ 
form linear operations on literal equnt ion-S b arid substitution^ of 
numbers in polynomials and exponentials. It would be relatively 
easy to add the facility for solving quadratic or even higher order 
Solvable equations. One could even add, quite easily^ sufficient 
mechanisms to allow the solver to perform the differentiation needed 
to do related rate problem* in the differentia], calculus. 

the semantic base of the STUDENT system could be expanded. In 
order to add the relations recognized by the SIR system, of Raphael, 
for example, one would have to add on the lowest level of the STUDENT 
program* the set of kernel sentences understood in SIR, their mapping 
to the SIR model, and the quest ion-answering routine to retrieve 
facts. Then the apparatus of the STUDENT system would process much 
more complicated input statements for the SIR model. One serious 
problem which arises when the semantic b-i.se is extended is based on 
the fact that one kernel may have an interpretation in terms of two 
different semantic bases* for example, "Tom has 3 fish. 1 * can 
be interpreted in both SIR and the present STUDENT system. To 
resolve this semantic ambiguity, the program can check the context 
of the ambiguous statement to flee if there has been one consistent 
model into which all the Other statements have been processed. If 
the latter condition doe# not determine a single preferred inter¬ 
pretation for the statement, then both interpretations can bo stored. 

In addition to these imcdlate extensions of the STUDENT system, 
our semantic theory of discourse can be used as a hasis far a isuch 
inure general language processing system. As a start, one could 
implement the- generative grammar described, in Appendix E to produce 
coherent discourse—problems solvable by the STUDENT system. 


tftz 


Another more exciting possibility ie to utilize. this type of speak¬ 
er's model of the world to attack Yngve'e "baseball announcer" prob¬ 
lem. The baseball announcer has certain propositions added to his 
world model from the events he perceives, i.e. the baseball game he 
is watching. Mandatory application of certain semantic; rules odd 
other propositions , and delete sotae that are there* While Cheae 
changes are going on„ the announcer is tu generate a running com¬ 
mentary (coherent discourse} describing this hall game he is watch¬ 
ing. Ey making the proper assumptions about Where the attention 
of the announcer is focused , that is , which prnpoSitions he is 
going to use as a base of his discourse at any time* I feel that a 
reasonable facsimile of an announcer can be programmed* This is. 
of Course, on empirically testable hypothesis * 

Another use for this inode 1 for generation and analysis of 
discourse is as a hypothesis about the linguistic behaviour of 
people. Psycholegists have built reasonable computer models for 
human behaviour in decision making (17), verbal learning of nonsense 
Syllables (15), and some problem solving situations (34). STUDENT 
may be a good predictive model for the behaviour of people when con¬ 
fronted with an algebra problem to solve* This- can be tested, and 
such a study nay lead to a better understanding of human behaviour, 
and/or a better reformulation of this theory q£ language processing. 

I think we are far from writing a program which can understand 
all, nr even a very large segment of English. However, within its 
narrow field of competence, STUDENT has demonstrated, that "under¬ 
standing" machines can be built* Indeed, 1 believe that using the 
techniques developed in this research, one could construct a system 
of practical value which would e®nim unit# fee well with people in En¬ 
glish over the range of material understood by the program. 
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APPEKlXlX E; LISTING OF T11E STUPEMI PKOGEAK 
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AFP EBP IX Ci UDltiAl, INFORMATION IK STUDENT 


REHEMB£(l( ( 

{PEOPLE JS THE PLURAL OF ptRSChl} 

(FEET 15 THE FLUJUL OF FUGT) 

(YARDS FS THE PLURAL OF YARD) 

(FATHOMS 15 THE PLURAL OF FATHOM) 

(Inches is the plural of inch; 

(STARS IS THE PLURAL OF SPAN) 

(ONE HALF ALWAYS MEANS -0.5 } 

(THREE NUMBERS ALWAYS MEANS THE FIRST NUMBER ANO THE SECOND 
number and the third wumber) 

(FIRST TWO NUMBERS ALWAYS MEANS 

THE FIRST NljMSEA AND THE SECOfcD NUM3EAJ 

(MORE THAN ALWAYS MEANS PLUS) 

(THESE ALWAYS MEANS THE} 

(TNQ NUMBERS SOMETIMES MEANS DNE NUMBER AND THE 
OTHER NUMBER} 

(TWO NUMBERS SOMETIMES MEANS UNE OF THE 

numbers an® the other number) 

(MAS IS A YEAS} 

(GETS IS A VERB} 

(.HAVE IS A VERB) 

(LESS THAN ALWAYS MEANS LEsSTHArf) 

(LESSTHAN 3S AN OPERATOR OF LEVEL 2) 

(PERCENT ]S AN OPERATOR DF LEVEL 2 } 

(PERCENT LESS Than ALWAYS MEANS PERLESS) 

(PEKLES5 15 AN OPERATOR OF LEVEL 2J 
(PLUS IS AN OPERATOR OF LEVEL 2) 

(SUM LS AN OPERATOR} 

(TIMES IS AN OPERATOR OF LEVEL 1} 

(SQUARE IS AN OPERATOR OF LEVEL 1) 

(OSVDY )S AN OPERATOR OF LEVEL D 
(OF IS AN OPERATOR} 

(DIFFERENCE IS AN OPERATOR) 

(SQUARED IS AH QPEflATOR) 

(MINUS IS AN OPERATOR OF LEVEL J} 

(PER LS AN OPERATOR} 

(SQUARED JS AN OPERATOR) 

(YEARS OLDER THAN ALWAYS MEANS PLUS) 

(TEARS YOUNGER THAN ALWAYS MEANS LESS THArO 
(LS EQUAL TO ALWAYS WEANS IS} 

(PLUSS IS AN OPERATOR} 

(MlNySS IS AH OPERATOR) 

(HOW OLD ALWAYS MEANS MHAT3 

(THE PERIMETER OF Si RECTANGLE SOMETIMES MEANS 

TWICE THE sup. OF THE LENGTH AND- WIDTH OF THE RECTANGLE) 

(GALLONS LS THE PLURAL OF GALLON) 

(HOURS LS THE PLURAL OF HOUR) 

(MARY JS A PERSON} 

(AHN IS A PERSON} 

(BILL IS A PERSON} 

(A FATHER l£ A PERSON) 

(AN UNCLE IS A PERSON) 

(POUNDS IS THE PLURAL OF POUND} 

(WEIGHS IS A VERB} 

93 

REMEMBER ({ 

(51 STANCE EQUALS SPEtD TIMES TINE1 
(distance equals cas consumption times 

NUMBER OF GALLONS OF GAS USED) 

(1 FOOT EQUALS 22 INCHES} 

(L YARD EQUALS i FEET) 

J } 
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APPENDIX b:: a jiJ-lriLI. SEMANTIC GRNKBAT[VK GRAMMAR 


The grammar Ou11 tried here will generate only word problems 
solvable by STUPKNIh though not the set of all such problems. 


ROLES 

EXAMPLES 

Create a set of Slmi.ittene- 

2x + 3y “ 7 

ous equations which can he 

y - im 

solved by strictly tin tier tech- 

2 

y + Jt “ & 

Cliques, eKcept that substitu- 


tion of numerical values in 


higher order equations which 


reduce them to linear L-qua- 


Clone ia allowed. These are 


the propositions of the speak.- 


e r r s nod el* 


Choose unknowns for which 

Z m ? 


STUDENT is to solve- This Is 


the question. 


Choose unique names for 

x “ first number 

variables without articles ,r a , % 

y ■» second number Tom chose 

,r an' r J , or lf the M * In the prob¬ 

z = third number 

lem any of these articles may 


he iised at any occurrence of 


a name, In a complete mod*! 


these names would be usiyCl- 


ated with the objects in the 


chosen propositions. 


Write one kernel sen¬ 

"2 times the first number 

tence for each equation- Use 

plus three times the second 

any appropriate linguistic form 

number Tom chose is 7. The 

given in the tablv below to 

second number Tom chose 
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represent the arithmetic 
functions in the equa¬ 
tion f 

tor each unknown whose 
value is to be found, use 
a kernel sentence of the 
form; 

Find __ 

Wh d t is _ 

or Find ____ and _ 

What are _ and _____ 

for more than one such un¬ 
known + 

If d name appears more 
than once in a problem t some 
(or all) occurrences after 
the first may he replaced 
by a "similar 1 ' name. Simi¬ 
lar names are obtained by 
Ctensformations which; 

a) insert a pronoun 
for a noun phrase 
In the name. 

b) delete Initial and/ 
or terminal sub¬ 
strings of the name. 

Only one such "similar"' string 
can be uied to replace an oc¬ 
currence of a name „ though 
any number of replacements 
can bn mad c„ 


equals ,5 of the first 
number* 

The sum of the second num¬ 
ber Ton chose and a third 
number is equal to the 
square of the first num¬ 
ber* What is the third 
number ? m 


Similar names: 

"first" for "first nunber" 
"escand number he chose" 
fot "second number Tom 
chose" 
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If occurs in S: and 
and in Sj it ia the 
ftntifft substring to the left 
■of rr ls"j "equals' 1 or M is 
equal to" {or the entire 
substring to the- right) then 
in S may be. replaced 

by any phrase containing the 
word "this". 


Any phrase may he 
replaced by another phrase 
which mean* Cho same 
thing. This would moan that 
STUDENT had been told of this 


equivalence using REMEMBER 

and the sentence "'F^ alwaya 

meana P,” or "F- sometimes 
i r 

means P^' r * 

Two conSeCwtlye sen¬ 


tence# may be connected by 
replacing the period after 
the first by 11 , and". A 
sentence can be connected 
to e question by preceding 
the, sentence by "If" and 
replacing the period at 
the end of the sentence by 

IT pr 
^ * 


Replace "the second number 
Tom chose" by "this second 
choice" in the third sen¬ 
tence ■ 


Replace "2 times'' hy "twice" 
and ".5" by "one half". 


Connect sentences 1 and 1 y and 
sentence J and the final ques¬ 
tion to give: 

“Twice the first number plus 
three times the Second 
number Tom cboSe is 7> and 
the second number ho chose 
is one half of the first. 

If the sum of this second 
choice and u third number 
is equal to the square ef 
the first number, what is 
the third number7" 
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S urnmary of Lingj i b t ic Forms to Express Arithmetic Functions 

and the Mqyia 1 It y Egj L gj lpn 


x = y k is yi k equals yj x is cqu,al to y 

x + y x plus yj the sum of x arwl y; x more than y 

x - y x minus yj the difference between x and y; 

y less than x 

x * y x times y; x multiplied by y; x of y -(if x 

is a number) 

x / y x divided by y} x per y 
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